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FOREWORD 


In view of the current “knowledge explosion” accompanying the “popula- 
tion explosion,” this cycle of the Review finds a geometrically increased 
body of literature extant. The editorial policy has had to shift in certain 
directions: (a) increased selectivity in the choice of references for 
citation, with no attempt at general exhaustive bibliographies, but with 
priority for data-based or theoretically oriented papers rather than for 
descriptive, nonresearch articles; (b) reliance almost exclusively on 
research articles from well-known journals and readily available books; 
and (c) allowance for a higher proportion of evaluative and critical 
comment for fewer references considered to be highly important, while 
pointing out important trends, theoretical implications, strengths, and 
weaknesses. 

Previous issues of the REVIEW concerned with educational and psycho- 
logical testing appeared in February 1956, 1959, and 1962. These sources 
were consulted for their general organizational patterns as a guide in 
planning the present issue. The period reviewed covered July 1, 1961 
through June 30, 1964; Chapter V included some earlier references, 

Structured tests and projective techniques for personality assessment, 
which were formerly treated in separate chapters. 
single chapter. Vocational interest measurement, formerly a topic in the 
chapter on structured personality tests, was thought to merit a full chapter. 
In the planning stage, it had been hoped to devote an entire chapter to 
new technology with regard to (a) testing in conjunction with programed 
instruction; (b) improved test-scoring systems; (c) improved data- 
processing systems; and (d) automated test administration. systems, 
Although desirable, such a chapter was found to be infeasible. However, 
some of the above topics are treated within this issue. 

The Chairman gratefully acknowledges contributions by the Committee 


members, chapter authors, and his own office staff for valuable assistance 
beyond the call of duty. 


, 


» were combined into a 


SaMuEL T. Mayo, Chairman 
Committee on Educational and 
Psychological Testing 


CHAPTER I 


General Overview of Trends in Testing 


HENRY MOUGHAMIAN 


Eve . 

ca cos et e psychological testing has not experienced 
Constant appraise r " à Uni the past three years, the period was one of 
the United iie a | ev aluation. The number of tests given annually in 
trend will eee S to increase: there are no indications that this 
i side in the near future. From the results of these tests, 


inferey 

1ces sant per : s 
concerning human abilities are drawn. Although this review 

any worthwhile contributions were 


f this chapter are twofold: (a) to 
he likelihood of the attainment 
ggest contributions 


eri , 
ae The not a revolutionary one. m 
cite those, primary tasks of the writer o 
of the contributions that will enhance t 
multiple objectives of testing and (b) to su 
those devel able to criticism. This chapter also treats 
ence to e'opments that have important implications for teachers. Refer- 
doe s issues of the REVIEW. particularly those devoted to 
Which t tal and psychological testing. will give the reader a basis upon 
© evaluate the developments to be mentioned in this discussion. 


that wi 
Will make testing less vulner 


Sources of Information 


Tests and Testing 

pone of the outstanding recent contributions in the testing i was 
°s' (1961) Tests in Print. In addition to serving as a classified index 
earp a Plement to all previous volumes of The _ Lei es 
“ngli ook, it presents a comprehensive bibliography of tests av: ‘a 
“ln ish-speaking countries. Of the 2,126 tests listed in the section calle 
Ak rint Tests,” 487 are not cited in books in sape 
? Included are the contents of two manuals for improving tes dee 
ndations for Psychological ests 


est m 
anuals: : P 
als: (a) Technical Recomme joint committee at 


and " 

a, ] ; hich w ared by a : 

the A, 870s/ic Techniques, ones a eie Educational Re- 
ements Used in 


eearch Anon Peyoholagical Assort il Measur 
Sacia ional Council on ; 
ciation, and the Nation Achievement Tests, 


Ucation, - dations for 
wh} , and (b) Technical Recommen i x 
Which was rests by a joint committee of the American Educational 


€search Association and the National Council on Measurements Used in 
Ucation., 


Wer, mong the most notable texts that 
€ Measurement and Evaluation in 


at appeared during the past three years 
Education, Psychology, and Guid- 
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ance by Adams and Torgerson (1961). Essentials of Measurement for 
Teachers by Durost and Prescott (1962). Principles of Psychological 
Measurement by Helmstadter (1961). and Educational Measurement and 
Evaluation by Nunnally (1961). The text by Adams and Torgerson is 
an excellent source for graduate students. Readings in Psychological Tests 
and Measurements, edited by Barnette (1961). should provide helpful 
supplementary readings for a course in measurement and evaluation. 

Recent revisions of noteworthy texts included Psychological Testing 
by Anastasi (1961). Theory and Practice of Psychological Testing hy 
Freeman (1962), Evaluating Pupil Growth by Ahmann and Glock (1963). 
and Measurement in Today's Schools by Stanley (1961). 


Testing Programs 


Many worthwhile books dealing with the use of various scales and 
measures in testing programs were published. Goldman's (1961) Using 
Tests in Counseling, Bauernfeind’s (1963) Building a School Testing 
Program, and Berdie and others’ (1963) Testing in Guidance and Coun- 
seling were among the most noteworthy. 

Another significant contribution to the area of testing programs was 
the Sixty-Second Yearbook of the National Society for the Study of 
Education (Findley, 1963c), entitled The Impact and Improvement of 
School Testing Programs. The contributors dealt with test theory and 
practice in today's schools. Several writers included critical evaluations of 
current issues and problems of testing. Recommendations were furnished 
which should be useful to those interested in testing, especially teachers, 

In this same yearbook, Findley (1963a) presented an excellent treat- 
ment of the purposes and development of school testing. programs, and 
Traxler and North (1963) gave suggestions concerning the selection and 
use of tests in school lesting programs. If the recommendations set forth 
by these experts were applied to even a modest degree, testing programs 
and research efforts would definitely be improved. 

In the first of a series of volumes. Albright. Glennon, and Smith (1963) 
presented a pragmatic account of the use of tests by industry, Although 
not so comprehensive as other sources in this area. their volume offers 
valuable techniques that should be of interest to those in industry. Jt 


should also eliminate some of the myths concerning the use of tests in 
industry. 


Criticisms 


Tests and testing have been focal points of criticism for many years, 
Much of the criticism has been unwarranted: some has been justified. 
Among popular critics were Black (1963) 
the test publishers to those responsible fo 
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» whose criticisms ranged from 
r interpreting test results, and 


———,— 
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Ho c -— 
zs um i himself to criticism of multiple-choice items 
mel bis ane E eading test-making organizations. Gross (1962) re- 
Minden grs to psychological testing in industry. The American 
Officers. and Ses nool Administrators. Council of Chief State School 
meii M ICM Association of Secondary-School Principals, Joint 
uU etema, vb pain. (1962 ) expressed their concern for the ramifieations 
series of rena sh programs in Testing, Testing, Testing. In the first of a 
(1963) q ports on the social consequences of ability testing. Goslin 
escribed the current status of testing in the United States in 


educati t 

ation, business z : pen = 
"usiness and industry. government, and the military services. 

common element in most 


f 


Possi s 
of dereud consequences were discussed. One 
the persons ee af testing has been the contention that far too many © 
In Testing in P e tests lack the ability to interpret test data. — 
Criticisms ^ oi Sal Noll (1961) presented an extensive evaluation of 
included refer ack (1963) and Hoffmann (1962). Noll’s reply, which 
and Debhiin (1005 to Testing: Its Place in Education Today by Chauncey 
initiated by V, 63) as well as to present movements and trends in testing 
most n Enos professional organizations. constituted one of the 
Criticisms um treatments available for those concerned with answering 
offered a pe peut modern testing practices. An editorial in Science (1963) 
Presented onvincing reply to those criticizing tests. Chauncey and Dobbin 
a notable discussion of the value of multiple-choice items as 


wel t 
ipd vigorous defense of testing. . -— 
tion, Dr ual meetings of the National Council on Measurement in ;duca- 
Woithwnre (1962), Findley (1963b). and Trump (1963) Hos only gun 
steps ig perspectives of the present status of testing but also suggested 
Progr at should be taken if testing is to conn to make d 
Gear the 1963 Invitational Conference on Teme Pro D v - ia 

a eviat listed steps to prevent further misconceptions 9 testing an 
€ possible adverse social consequences of testing. 


Both pr : e e inadequate reparation of 
professional and lay critics cited the eg Nol (1961). and 


leach 
ers in tests a Bowman (1 
sts and measurements. Down? à 
sner (1961) expressed the need for increased teaches t met ^s 
is problem. a com 
of this F a test of 


Is area, R Rp [ et 
Nath Recognizing the importance s i 
b Council on Measurement in Education CoA bond ^ 
initiate, competence in educational measurement. 
Mid by the Council. Mayo (1964 ame 

© a check list containing 70 statements à 
at prospective teachers in measurement should pers Me 
* research should have important implications for teacher training. 


e work 


Developments 
Although the use of tests with televised instruction is not new. this 
“view period witnessed a trend toward the increased use of tests w ith 
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programed materials. Erickson (1962) described the functions of exam- 
inations in the measurement of outcomes of televised instruction in the 
Chicago City Junior College. A more recent development of this program 
has been the increased use of programed evaluation in televised courses. 
one example of which was a study guide prepared by Engelhart (1963). 
Heath (1962) identified the functions of measurement associated with 
programed instruction as (a) evaluation of student achievement. (b) 
evaluation of the program, (c) use in program development, and (d) 
diagnosis and guidance. He discussed several possible effects of pro- 
gramed learning on achievement test construction. 

In recent years, high-speed scoring services for large-scale testing 
programs have become available at the Psychological Corporation: Science 
Research Associates; Educational Testing Service; Harcourt, Brace and 
World; California Test Bureau; and Houghton Mifflin Company. More 
recently, high-speed electronic scoring machines have been made available 
to institutions of higher learning and to city school systems. The use of 
computers in test scoring and analysis has also grown rapidly. In addition 
to describing the ramifications of a computer-based information system 
on test scoring and analysis, Anderson (1962) discussed the facilitation 
of the work inherent in establishing norms and in producing item analyses. 
Stafford and Bianchini (1963) reported a program at San Jose State 
College that involved the use of the IBM 1620 in scoring teacher-made 
tests. These developments seem to represent a trend toward wide use of 
automated scoring devices in conjunction with computers. The extent to 
which this trend will become general is likely to be somewhat limited by 
economic factors, since present costs are high. 


Large-Scale Programs 


In the first of two reports since 1961 on th 
Flanagan and others (1962) studied various A 
They reported their findings on the div 


ely in the evaluation of characteristics 
e American high school student. The 
xtremely influential in the realization 
and evaluation functions of American 


comprehensive findings should be e 
of improvements in the guidance 
high schools. 

Stalnaker (1961) described the research con 
the National Merit Scholarship program. H 
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ducted in conjunction with 


olland and Astin (1962) 


bh 
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studi e s 
sien Ji tad ( academic. artistic. scientific. and social) of 
Theis Fate i es ad xelatlón to attributes manifested in high school. 
fhe same vat in general agreement with most previous research in 
at Tarpen m study in which more than 1,000 college 
rition tiskan h -ehmann (1963) found changes in students’ 
years, The mi : i — and values from their freshman to senior 
Maine emi changes, which occurred during the freshman and 
Sonepédasa neo, ure atille by the students mostly to informal, 
He implied tag ei ather than to the formal, academic course work. 
at identifying those experiences which contribute toward 
or nonacademic, is one of the major 


thes 
Pre RENS, whether academic 
ms of colleges and universities. 


Construction 


i bs construction of tests to measure the attainment of intellectual skills 

development of tests to measure 
ect of the problem has been one 
there is no doubt that tests can 


of d 
he difficulties has been that of 


efini "er s 
ing various intellectual processes. 


and q 

o me 
i measure such processes. One of t 
e emphasis from the measurement of mere knowledge of facts 


e e measurement of intellectual skills, especially in teacher-made tests. 
thas he eae of such objectives as values. attitudes, ie p 
Years. qu prs an obstacle to individuals in test poeni m or many 
md x hether in a planned and systematic or an une, e d 
uated lh. EN attitudes, values, and beliefs have baet py dae i 
education = volume concerned with the development o : So pe 
Masia Ts objectives in the affective domain, ipo oe etm e 
Poses, T} 64) contributed a noteworthy structure er ath er ion 
in gl «e authors provided a schema which should e valuable cen 
s vesilying objectives and in writing test items 1n the affective ii 
oliin authors pointed out, problems in the a domein PAM mA 
M ce lo terminology and overlap in categories. Neverth : i 1: 
i "S opinion that their taxonomy can and will initiate mar ed prog 


In th : » 
€ measurement of traits in the affective domain. 


Engelhart and Beck (1963) evaluated the improvement 
how to cons 


of tests over 
truct better 


th 
ey : 
iei Years and gave practical recommendations on s mh 
Sts. This section will be extremely helpful to teachers in preparing heir 
in interpreting normative 


s. and 
also be o 
aminations. 


own á 3 
lests, in selecting standardized test dE acilleralila dii to 


Cata , 
ta. Green s (1963) short volume should 


teachere ; 
Pera in the construction of their own bs y E : 
‘bel (1962) described the development of specifications for educational 


te : - 
Sts which would contribute to the design of balanced and parallel forms 
an examination. Ebel's suggestions for test construction. along with 
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those previously mentioned. will aid in overcoming what he termed — 
of the important problems in preparing an educational test: (a) rea 2 
tion of a representative sampling of content areas covered by the test. | 2 
production of parallel alternate forms. and (c meaningful description o 
the tasks included in the test. From his experience with instructional 
technology, Glaser (1963) generalized the need for establishing levels of 
competence and performance in the development of achievement. te : 
relating to learning outcomes. The need for the construction of test items 
that would maximize group di 


stinctions was also suggested. 


Administration 


The importance of following prescribed directions in administering 
standardized tests is well known. In the instance of teacher-made tests. 
the relative effect upon test performance of various factors involved in 
test administration is not quite so evident as it is for standardized tests. 
Michael and others (1963) investigated the effects of three 
directions and four types of scoring formulas upon the performance of 
examinees. They found that higher scores resulted when gu 
encouraged and that scores decreased when 
These results have implications for norms of st 
on what the directions are relative to 
experience, scores ma 
behavior rather than 


types of 


Sing was 
guessing was discouraged. 
andardized tests, depending 
guessing. Since students differ in 
y reflect a degree of sophistication in test-taking 
genuine differences in achievement. 

For students responding to items of a vocabulary test, Boag and Neild 
(1962) studied the effect upon performance of time limits conducive to 
speediness and of an unlimited time interval. They found that some stu- 
dents changed their relative class standings when they were given addi- 
tional time. Unlimited time benefi 


ted to the largest degree the average 
student who worked slowly and accurately, 
Winiewicz (1963) listed desirable 


cards instead of marked sense answe 
advantages mentioned for 
consideration. 

In an excellent account of the Los Ange 
teachers in proper 


effects of using porta-punch answer 
r sheets in achievement testing. The 


the entire testing program are worthy of 


les schools program to aid 
test administration in the element 
11902) described work leading to the 


records that were used to assure uniforr 
tions for two group standardized tests. 


ary school, Bowman 
development of two phonograph 
nity in the presentation of instruc- 


Interpretation. of Scores 


As mentioned previousl 
ing the ability of teachers 
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; there have been numerous criticisms regard- 


lo interpret test data accurately. If these well- 


a” 
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founded criticis 
ke Mr Mea m "udin e ape and if continued progress is to be 
interpretation e^ e ES Hna that marked improvement be realized in the 
In Test Sc DSE ae 
that Boula pep pini á hat They Mean, Lyman (1963) presented a text 
involved: fn. mensum : ae ner who are nol well versed in the statistics 
and their ore wem nt he text emphasizes the various types of scores 
ment. Ohlsen (a) f Hes than do most texts in educational measure- 
interpretation of “ee furnished an extensive and excellent account of the 
effective test lai scores. His recommendations are important aids in 
Ens "m : i ALOT . . 
gelharts (1962) discussion of advantages of stanines over other 
achievement should be quite 
(1962) described the use of 
The increasing use of stanines 
which are 


convert 
ed score - 
helpful : ond measurements of ability and 
O teachers ¢ i 3 d E 
Sap ichers and. counselors. Demsey 


si š " 
seems t * reporting test results to parents 
5 to favor the tre 
wor the trend toward coarser units of measurement. 


ess suscepti : 
eptible to misunderstanding. 


Measurement Theory 


Me; 
Measureme n 
nent of Change 


is of primary 


a problem that 
the degree of 


The 
^ Measureme 

surement of change presents 
ble estimate of 


Sienife. 
c — m education, Without a reliable € | 
Praisals S pe a student's or a group's performance, valid AE 
of other i viri of instruction, of materials used in —? ane 
Measurin P f dles influencing achievement nani be made. Prob lems a 
Several on d tange, edited by Harris (1963). includes p ospita y 
of dne inhians: ig of the major problems involves in t Aion 
implicatio, : or example. Bereiter (1963 F presented possi i so icr 2 
in tnt for further sarch in his discussion of three oe dien 
to the iinet of change. His chapter provides an es oaen = 
(1963) na of measurement of change- In another pessoal : 
Change: discussed two major sources of confusion x pons pent e 
sidered. regression effect and errors of measurement. ac o n: 
ge ed in evaluating observed changes are cited. and models are sug 
“ested for various ae " ditions which would bring observed 
Measuren arious experimental conditions WO 

tdi, ments more into line with true measurements 

ditional Reference: Harris (1962). 


Validity 
isal of the concept of validity. 
on of construct validity. of 
ications for achievement 
of instructional 


11 
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: i review period saw a 
its ud (1961) presented a thoroug 
elationship to content validity. and of 


teste F 
ts. Engelhart (1961) discussed ope 
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objectives in relation to validity. Findley (1961) related the use and 
interpretation of achievement tests to validity. The three preceding papers 
furnish an excellent framework from which teachers can obtain informa- 
tion useful in selecting or preparing achievement tests. 

Ebel (1961) stimulated some controversy by questioning the 
of validity. At the 1963 Invitational Conference on 
Anastasi (1964) presented an evaluation of construct. validity. decision 
theory, moderator variables, synthetic validity, and response styles in the 
measurement of test validity and in its interpretation. Since use of these 
terms is associated with relatively recent developments in measurement. 
Anastasi reserved judgment concerning their future relationship to think- 
ing about test validity. Tucker (1962) concluded that content validity is 
contingent upon value judgments of persons defining validity. 
ceding presentations, which indicate a continued conce 
lems of validity, should give direction to future 

Additional Reference: Engelhart (1961). 


concept 
Testing. Problems. 


The pre- 
rn with basic prob- 
research efforts. 


Academie Criteria and Grading 


For many years, the problem of approach in college prediction has 
been a challenging one. Bloom and Peters (1961) provided some valuable 


aling grades for 23 secondary 
within-school correlation coefficient 
school and college grades) to 0.77 (for 
- They also presented implications for 


elopment, and educational research. 
ing a sample of 608 schools, 


studied by Bloom and Peters, rnal scaling, 
se in median coefficient of correlation, 
en these studies can 
acteristics of the schools and colleges 
hese studies do not constitute 
bably do furnish 
alidity that m 


a formal 
an improved estimate 
ay be obtained through use 


Additional References : Angoff (1963) : Williams and Johnston (1963) 
Willingham (1963). 
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CHAPTER II 


Trends i 
ds in the Measurement of General Mental Ability 


JASON MILLMAN and MARVIN D. GLOCK 


The 
emphasis 5 
s during the last several years has been on the nature and 


determi -toli 

Ux ud hes intelligence rather than on the development of new 
lewd “ach gia: mental ability. In an excellent book in which he 
(1961) ti Ros m work of Piaget, Hebb. Harlow. and others. Hunt 
attacking HE bosse streams of theoretical formulations. After 
Tunt presented a i o ixed intelligence and predetermined development. 
tion of nibo nbept of intelligence ‘based on a hierarchical organiza- 
deriving oe me Id and information-processing strategies 
fepreventations = erable degree from past experience” (p. 109). These 
acts with its ae strategies change as the organism continuously inter- 
of the silicate ironment. The focus in this chapter will be upon some 
ight of the s nf Issues in the measurement of general mental ability in 

merging concepts of intelligence. 


Heredity and Environment 


Catt , 

edd ee ee improved his multiple abstract analysis-of-variance 

tween the e SRI to take into account interaction and correlation be- 

relative jr nvironment and heredity. In spite of Cattell’s claim that “the 
nportance of environmental and hereditary influences has be- 


come or 
ce " 8 . 
agreed me more a controversial topic in psychology * (p. 191), most writers 
vith Anastasi (1958) that the more important question is how 
Hayes (1962) and 


heredity 
en oe environment produce their effect. Both : 1 
attributabl ) pointed out that questions about the proportion of variance 
Population ie heredity or environment are meaningless unless the 
manipulate is specified. Furthermore. as scientists learn how better to 
importan e the environment to maximize intellectual growth, the relative 
Kind of the environment will increase (Hunt, 1961). 
1H nr (1958) differentiated between two classes of environ : 
stimuli "e those. producing organic effects. and those serving E! keit. 
© the on psychological reactions. This distinction appears to be i ge 
extensio utritive and educative effects proposed by Clarke (1962) in hus 
i P. of Hebb's (1958) concept of intelligence. l a 
(1962) arene the relation between heredity and environment, ae 
abilitie argued that the inherited part of intelligence consists m o 
cies t s but of experience-producing drives (El D’s). which are tenden- 
9 engage in activities conducive to learning. 


mental 


These learnings represent 
According to 


e m UR j -ioli 
easured aspect of intelligence- Hayes. prediction of 
17 
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i i i ig © facilitated bs 
later intelligence. especially in a young child. might be facilitated 
assessment of EPD's. 

Additional References: Honzik (1963): Kidd (196 


Constancy of the IQ 


Long-term reliability of intelligence test scores continued to be reported 
in the literature (Bradway and Thompson. 1962: Escalona and Moriarty. 
1961; Pinneau, 1961). Bradway and Thompson presented data which 
showed that vocabulary and abstract thinking abilities continue to increase 
into adulthood—an observation that supports Cattell’s (1963b) hypothesis 


that crystallized intelligence can reach its maximum long after biological 
maturity. 


When intelligence is seen as capable of modification by the environ- 
ment, nonconstancy of the IQ is considered “the most pervasive finding 
(Wallach, 1963). and any long-term stability of the IQ is accounted for 
by a constant environment and by Anderson's (1939) overl 
The overlap hypothesis suggests the constancy in a child" 
on the retention of the s 


ap hypothesis. 
s IQ is dependent 
ills and knowledge that were the b 
scores, rather than on the relation between these ez 
A major work related to the stability of measured intelligence was 
contributed by Bloom (1961). Bloom pointed out that by the overlap 
hypothesis the correlation between intelligence scores at two different times 
is a function of the percentage of development at the later time which had 
been attained at the earlier time. He calculated that. from conception to 
50 percent of his mature tage 17) 


age four, the individual develops 
8. 30 percent: and from ages 8 to 17, the 


asis of early 
tly scores and gains. 


intelligence: from years | to 
remaining 20 percent. 

Like Hunt (1961) and others. Bloom treated intelligence as a develop- 
mental concept that is modified as the individual interacts with his 
environment. The effect of the environ 
the earlier. more rapid period of development. The 
research efforts in which improved utilization. of information about the 
early environment may be made. Even though it may be difficult. 
impossible. to effect major changes in intelligence after it att 
level of stability. research is also neede 
individual may be enabled to 
effective ways. 

Additional References: Colver and 
(1961): Pringle and Pickup (1963). 


ment was seen as being greater in 


implications su 


gest 


if not 
ains a high 
d to find ways through which the 


utilize his characteristics in increasingly 


Spielberger (1961): Levinson 
Nonintellectual Factors 


The point of view that sees intellectual development 
how the individual interacts with his e 
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as determined by 


nvironment suggests the increased 
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importanc 
ance of research i shi 

arch in which so-called nonintellectual factors are 
. There is a need to study the 
ate or to hinder the acquisition 


relat i 
voies to performance on intellectual ta 
a tonal conditions which serve to facilit 
eo of intellectual skills 
nehtsman 962 d "a 

(1962) reported that anxious subjects did score signifi- 
of intelligence. 
ests under 


cantly lower the 3 
Gere ae did nonanxious subjects on measures 
the conditions in n d esi uie m i eec 
unimportant. On the otl s jac been told that the test was important or 
performed less well M rer hand. subjects low in achievement motivation 
when they ae when they thought that the test was important than 
Bettveon Meng a the test was not important. Triple interactions 
performance Si tied ‘areas and task importance on test 
an seen “ported, 
degree eed (1961) noted an absence of a linear relationship between 
lhe highly pia and degree of failure on increasingly more difficult tasks. 
Presented an paa l Subjeca had a strong drive to achieve until the task 
which suggest e) threat. These findings were at variance with past results 
of IN us am an inverse relation between degree of anxiety and quality 
Katz (196 n in complex learning situations. 
difficult intelle Ere a study of his which indicated that. especially with 
White than with N ly oriente tas Negro students perform less well with 
us which reveale ae administrators. Katz referred to another study of 
students Mi BS n nat when there was some incentive to succeed. Negro 
compared to ips hetei on intellectual tasks in which they" were to be 
e compared 4 Cae sample than on those tasks in which they were to 
Neither cha national sample. 

9n the d ewe nor this chapte € 
intelligence TI Pelien factors in the envi n f 
indicative of Ü ne study of how one can best manipulate the intellectual 
ieir general psychological ur ance ( Humphreys. 1962). 


aspects j 
of the environment to help children dev elop their mental abilities— 
the future (Bruner. 1964). 


lo unlock 
Aging their capacities—is a challenge for 
(1962 tonal References: Cattell and Scheier (1961): Guertin and others 
) : Silverstein (1961). 


r has been concerned with research 
ronment on the growth of 


import 


Factor Analyses of Intellectual Abilities 

llect continues to receive 
] workers. Humphreys 
of abilities in the Vernon- 
be in concert with the 
(1963b) continued 
gence which were 


he structure of inte 


Gui E 
ilford's (1959) model of tl 
and educationa 


Wide ; z 

| ta elton among psychological nd c 

fi a hierarchical organization 

recent der oe Such an organization € a 

lo explor. i gaa concepts of intelligence. am A 

rived M lis notions of fluid and crystallizec intelli 
rom factor analytic models. 

f e ago that 


Gui 

ilford’s bs l 
lford’s (1956) hopes of almost a decad 

to their factor composition 


de 
intelligence tests 


and that tests of 
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general intelligence would include more emphasis on productive thinking 
seem not to be realized. The low coefficients of correlation among Guil- 
ford’s tests (Guilford, 1964) suggest that they are measuring highly 
specific abilities. Their low generality in the test battery is most likely 
indicative of their general psychological unimportance (Humphreys, 1962). 
The usefulness of factorially pure tests to predict complex criteria has not 
yet been demonstrated. . 

Perhaps consonant with a developmental concept of intelligence is the 
apparent renewed interest in the factor analysis of tests of ability for 
different groups (e.g., Eysenck and White, 1961; Green and Berkowitz. 
1964; Mukherjee, 1962; Taylor, 1964). In Tucker's (1963) model for 
the factor analysis of three-way matrices are possibilities for the analysis 
of relations between ability measures recorded for the same people on 
several occasions. 

Although some factor analysts write as if they believe that factors are 
mental phenomena, factors are not entities of the mind. Factors serve 
primarily as categories for classifying existing tests of ability. Thus, models 
of intelligence based on factor analysis will be of limited usefulness in the 
description of the intellectual organization which is now 
research concerning the nature of intellectual activities unti 
on these new intellectual processes are developed. 


emerging in 
l tests based 


New Approaches and Tests 


One of the intriguing aspects of the work of Piaget—clarified and sum- 
marized by Flavell (1963) —is Piaget’s belief in a fixed order in which 
successive structures of behavior and thought become manifest. This 
ordered schema suggests a possible ordinal scale of intelligence in which 
rate of development can be measured by the time elapsing between what 
Hunt (1961) called the “landmarks of transition” in development. 


’s thinking, several tests have 


ct a scale better adapted to the 
elligence was that of Laurendeau 
à standardized test of the child's 
Y; contains a highly structured interroga- 


and Pinard (1962). The scale, which is 
conception of reality and causalit 
tion format. 


For some investigators (e.g., Wallach, 1963), the boundary between 
thinking and learning is an arbitrary one. Carroll (1963) defined the 
aptitude for a task as the time it takes one to learn the task. Sorenson 
(1963) employed this learning rate concept and suggested advantages of 
using the amount of time spent in doing programed materials as a measure 
of intelligence. In accord with the emphasis on potential for learning, 
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several writers (e.g. C 
ady (Ega ampbell, 1963; Vygotski : 
ie oo students in the ee wi ids i 
is a basis fee ea HEC ssary to solve intellectual 
educability. etermination of an improved measure of 
greemer à 
biased AU d. yes reached on the appropriate role of culturally 
group tests of dee m s g general mental ability. The belief that current 
in large part res gence are biased against the culturally deprived was 
tests in New = ag for the 1964 decision to ban group arel zece 
The tl bag schools (New York City Public Schools, 1961). 
10636: Riessman, d = genler use of culture-fair tests (e.g.. Cattell. 
that these tests i 32) needs to be considered in view of the likelihood 
culture will i de comparison with tests reflecting experiences within the 
differences, o diem well those behaviors dependent upon cultural 
gence doe ore, one can argue that since he grow i i 
the e rr WEN in a vacuum but is nourished aem As 
he trend iier qe on tests should not be ignored. 
traditional a abbreviated and less complex achievement-oriented 
(1963) sele > of general mental ability continued. Thompson and Finley 
Children nis the subtests from the Wechsler Intelligence Scale for 
gifted oen i would best predict full-scale IQ. For a sample of 151 
0.84 with th ary school children, the scores on their short form correlated 
score tec e ll 1Q scores based on the full scale. The predicted 
the cases reer four 1Q points from the actual score in 74 percent of 
ndings ie om eight points in all but 2 percent of the cases. Positive 
retarded pu "s from using the same procedure with educable mentally 
"e a une (Thompson and Finley. 1962). 
With ene performed an item analysis of the Progressive Matrices 
ndings jio SOUS BEOUDS of highly intelligent university students. His 
So that the Partel him to recommend that some of the items be dropped 
uring a pot nna would become a power test rather than one meas- 
items, Woli bination of speed and power. Through employing selected 
usefulness nf mh and Bachelis (1960) found that the reliability and 
cale could ] M original verbal scale of the Wechsler Adult Intelligence 
be maintained for 100 pa age from 16 to 72. 


ther i 
© ives à : 3 
nvestigators found substantial relationships between scores on 
e general measures of intelli- 


Short y 
vo nd 
gence HET tests and scores on the mor 
mmons and Ammons. 1962b: Miner, 1961). The Quick Test 
e forms, each con- 


develo 
D y 
eloped by Ammons and Ammons (1962a) has thre! 
f four pictures. Correct solution 


anino 
of B om vocabulary words and one set 0 i t solutio 
Meaning m consists of matching à word with the picture depicting its 
& Mental age norms are available for chronological ages 2-18. 
orms were published for 


atio 10° 
IQ's were suggested. IQ and percentile n 
countries. Joshi (1963) 


adults, e 
A 
nu p " : 
mber of new tests were developed in foreign 
al ability in the Hindi language. 
ted as a general 


Constr 
l ob si a verbal group test of ment 
jects exhibited high loadings in what was interpre 
21 


tients ranging in 
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(g) factor. The first foreign-language version of the Scholastic Aptitude 
Test (SAT) of the College Entrance Examination Board has been com- 
pleted in Spanish (Dieppa. 1961). “Equivalents” for the American SAT 
are not yet available. : 

In the area of the handicapped. Jones (1960) developed a tactual intel- 
ligence scale for the adult blind. 

Harris (1963) revised the scoring system of the Goodenough Dravw-a- 
Man test in the hope of extending the Goodenough scale to include the 
adolescent years. He also developed an alternate form in the figure of a 
woman, Kennedy and Lindner (1961) provided norms for Negro subjects 
on the Draw-a-Man test. 

Additional References: Duncanson (1964); Lucier and Farley (1961). 


Conclusion 


In this chapter, emphasis has been placed on an emerging concept of 
intelligence. In this concept. the structures of general mental ability. 
are partly developmental, are characterized by general strategies for 
processing information. The environment of the young child, 
his educational experience. is seen as playing a k 
ment of intelligence. Intelligence tests should serve 
those who can control the environment as well as to predict future. intel- 
lectual development (Ebel. 1961). A convergence of the measurement of 
general mental ability and learning. with emphasis on 


rate of learning 
rather than on asymptotic behavior. is seen to be a likely trend. 


which 


especially 
ey role in the develop- 
to direct the efforts of 
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CHAPTER III 


T " 
rends in the Measurement of Special Abilities 


PHILIP R. MERRIFIELD* 


In a ti ; 
dion of Ue some proclaim the specialist and others seek a current 
ing id the area pr E man, the objectives of those individuals work- 
in Bir nos ax 4. blur measurement are challenged. It is not enough 
be able to det S tau. 1n assigning a labeled number to a person; one must 
it should bs i what kind of a number, why it is desired, and when 
will find seen oyed. The scope of this chapter is restricted: The reader 
Notations ee i of general intelligence per se; Nor will he find 
Rather, hi garding either motor skills or nonaptitude traits of personality. 
ien (a) ok will be directed toward markers on the following 
evidence f theoretical issues that concern measurement workers, (b) the 
rom validation studies involving special abilities, and (c) new 


tests waiting to be used. 


Theoretical Concerns 


I ; 
n a recent article, Cattell (1964) raised a host of problems concerning 
of discourse can but 


validi 
s ani reliability. This never long quiescent area 0 € 
rom hi rom his insight into kinds of validity and kinds of relia a 
sis, h us well-known formulation leading to several kinds of factor mip 
lens stipulated the following validities. in the description of which the 
Psych pum to degree of abstraction and to immediacy of Lara M to 
dis ogical trait: (a) concrete-direct, (b) concrete-circumstantiah (6) 
ew ptual-direct, and (d) conceptual-circumstantial. Instead o relial ility, 
tran =, consider consistency, in terms of reliability, a ane a 
ten Further, he mentioned that in many cases Por ty 
impr not be maximized, but rather optimized, in order wa sh 
hie oved combination of validity and reliability in factor sc es. Lene 
nately, that argument is imbedded in Cattell’s preference for i: = 
ea and multitrait descriptions of factors. It seems to the mà E 
^ a a very large number of descriptive variables, it haha a As 
t Ick to specify one combination of them rather precisely; insteac, © 
Tick seems to be to identify a trait with as little overlap of descriptive 
Variables as is logically possible and to hope that the data support the 
differentiation, Tt may a and validity can be jointly 


maxi be that homogeneity i 
aximized to higher values with orthogonal than with oblique representa- 


` 
; m E System. Development. Corpora- 
tion Bis chapter was prepared during the authors tenure at System T Corpora: 


* Whose assistance is gratefully acknowledged. 
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tions of traits. However. an evaluation of the relative merits of ap 
and orthogonal approaches is too complex an undertaking for this bric 
article. , O 

În a salutary plea for awareness. Fiske and Butler (1963) directer the 
attention. of their readers to the effect of experimental conditions 2 
responses when an experimenter is obtaining measures of individua 
differences. They noted how the attitudes of the examiner for aptitudes— 
helpful, supportive— differ from the attitudes of the examiner for person- 
ality—noncommittal. cold. perhaps even a little disdainful. That the re- 
sponses in the two situations differ. at least in affect, is no surprise. Makers 
of tests, especially measures of aptitude. should heed the dangers of telling 
the examinee too little. Along the same line. Gerlach and others (1961) 
demonstrated that orienting examinees to do ori 


ginal work or to give 
remote associations produced almost the same level of response, relative 
to control groups. as had been induced by other experimenters through 
the use of brainstorming training. Finally. in an intriguing study. French 
(1963) reported differences in the factor content of relatively well-estab- 
lished tests of ability between a group using a systematic problem-solving 
approach and a group using a scanning approach to the test items. 

In a fundamental paper. Keats and Lord (1962) 
tion of true scores from mental tests and den 
of-fit to actual distributions that the most plausible distribution is the 
negative-hypergeometric function. As the shape of this distribution. like 
the more familiar binomial, can be changed by a 
à parameter—unlike the normal distribution. wh 
shaped—the following question may be 
shape of distributions of me 
in populations of intere the true measures of some 
abilities are skewed. Perhaps the observed distributions of IQ measures 
and of other broad-gauge measures of mind are normal by courtesy of the 
central-limit theorem. not by reason of what may be called a rule of 
Quetelet. é 


discussed the distribu- 
onstrated through goodness- 


gning a new value to 
ich always remains bell- 
raised: What is the appropriate 
asures of special abilities? It may well be that. 
st to psychologists. 


Two recent papers by Guilford (1961 
advances in the study of special 
Guilford (1961a) included refere 
informational transfer: this the 
thing in common with 


a, b) represented noteworthy 
aptitudes. In his chapter on creativity. 
his new theory of learning as 
ory seems to the reviewer to 
Brentano's act-psychology, 
it incorporates the heretofore i 
response (S-R) and the Gest 
a theory would have to be vague. let him be assured that such is not the 
case. The structure-of-intellect formulations are not only relatively precise 
but also supportive of the deduction of testable hy potheses regarding the 
nature of thought. For his article on zero correlation uilford (1961bà 
examined the frequencies of values of correlations from 13 studies in his 
series of factor analytic investigations of aptitude. He found that nearly 
20 percent of the coefficients were close enough to zero that. if optimally 


26 


neces to 


have some- 
while at the same time 
disparate explanations of the 


stimulus- 
alt schools. Lest the 


reader assume that such 
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placed, they v 
y would s d > or i 
imis cer dey B support the orthogonal representation of some 15 
study. the number typically obtained. 


Additional Reference: Milholland (1961). 


Validation. Studies 


Mii senian £e Ace 
CAR ec is divided into factorial validity and predictive validity. 
al studies will be reviewed that deal with the establishment of 
ections of tests. These 
supported the initial 
re will be a discus- 
dict useful criteria 


factor x n 

na eee of individuals or in coll 

[CRAS eia. ections differ from those which 

sion of the f 9. the factors. In the second section. the 
1e findings of researchers who attempted to pre 


fror 
n measur i 
measures of special abilities. 


Factori 7 
torial Validities 


area of verbal fluency 


Christens 
istensen and Guilford (1963) explored the 
‘lier studies 


and teste 

were mae that kinds of fluency evidenced in ear 

d hat heywotltests ids of restrictions placed on the acceptable response. 

Istence ‘of Eras not generally supported. although the separate ex- 
s s describable as word fluency, ideational fluency, associa 


tional 

uency, i : 

Peg and expressional fluency was confirmed. In a study of 

: thinking abilities, Guilford. errifield. and Cox (1961) tested 
evised forms 


ninth er; : 
of "s w a two school systems. Their battery contained rev 
of adult ee ha loaded heavily on the hypothesized factors in analyses 
cadets, Theis ; samples of relatively high ability levels—e.&- aviation 
of the ra indings not only established the factorial validity for many 
that had b E. the ninth grade level in terms of identification of the factors 
abilities MER hypothesized but also furnished the definition of neat 
structure. on tests that had been developed from the py! 2 es 
11963, i intellect model. Similarly. Merrifield. Guilford, anc = se 
thes wes M ministered a number of tests to @ sixth grade sample and oie 
These É factors as those that had been previously identified dors ults. 
perhap ast two studies seem to indicate that the Dalit PREIE font 
ate ve the dimensionality of intellect, may be essentially the gent ism 
that childhood to adulthood. This interpretation counters ihe den T 
olc B Panera factor fully characterizes infant mental ability ane nie 
Suc} the individual, the less will be the importance of the genera be! $ 
A an allegation may be tenable for children less than ]2 years o um 
"Ea notable event for those working with special abilities vae oad the 
P n of the Kit of Reference Tests for Cognitive Faclons 1 his i M Se 
Sompanying manual (French, Ekstrom, and Price. 1963) should lea 
y standardization in th dissembled fie 


Id of measuring 
Speci i 4 E 
H Cpe! n 
al abilities. It is hoped that efforts w! 


he easily 


further | be directed toward 
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increasing the number of factors for which tests are generally accepted 
by workers in the field. 


Predictive Validities 


Although the effort did not start with Getzels and Jackson (1902). their 
book has stimulated a number of investigations, 
hypotheses (a) that teachers prefer high 1Q to high creative students and 
(b) that the two groups have essentially equal achievement. When one 
considers the shortcomings of the study, as did de Mille and Merrifield 
(1962), it is not surprising that the findings were challenged, especially 
since the early resolution of issues dealing with the accommodating of 
both creative and intelligent children is of paramount importance. In his 
presentation of a clean replication of Getzels and Jackson’s study of the 
comparative effects of creativity and intelligence on high school achieve- 
ment, Yamamoto (1964a) found no significant difference between groups. 
Following Getzels and Jackson's design, Yamamoto separ 
into (a) the high IQ group—children in the top 20 per 
but not in the top 20 percent on creativity me 
creativity group—children in the top 20 perc 
but not in the top 20 percent on IQ measures, 
those of the earlier study, were of 
the total group was 118 before its 
remains that children of 
any case, whether the 

On another. tack, 
Cline, Richards, and 


especially of the two 


ated his subjects 
cent on IQ measures 
asures—and (b) the high 
ent on creativity measures 
However, his subjects, like 
generally high IQ. Since the mean of 
separation into two groups. the point 
such relatively high aptitude should do well in 
Y are creative or not, 
utilizing all the data in a correlational approach. 


Needham (1963) found th 
$ d s at a measure of IQ added 
very little to the multiple-R obtained for creativity tests as predictors of 


scien i y 
ce achievement for boys, but that there was a small increase attribut- 


rie to IQ for girls. This result does not support very well the contention 
at creativity and IQ are opposites, at least wi 
ance with high school science achiev 


In a further investigation of his 
within the high creativity group 
of children with IQ's greater th 
children with IQ's between 135 a dl 

3: 20, 
less than 120, had sien; " s à A 
pect that, giv 


found that. 
the achievement 
different from that of 
at a third group. with IQ's 
ent scores, One would sus- 

ede E IQ dp than 120, the possession of creativity is 
ement. Comparing elementary school level children in 

the same study, Yamamoto found differences in 
differences in IQ. Perhaps the tests had a more ap 


entary than for the secondary level children 


a contention which they did 


" lo arise by easy inference— Richards. 
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Cline, and Needham (1964) maintained that Getzels and Jackson’s instruc- 
tions to teachers were heavily biased toward IQ criteria. They neatly made 
their point by asking teachers to rate children on creativity. They found 
no difference in preference by teachers that could be related to creativity 
or IQ measures. just as there had been no real difference in Getzels and 
Jackson’s study in teacher preference for students in the high IQ groups. 

In a comparative study of convergent and divergent thinking, Klaus- 
meier and Wiersma (1964) reported that girls scored lower than boys on 
convergent, but higher on divergent, thinking tests. This result is con- 
sistent with findings of many other investigators. Children in grade 7 did 
better than those in grade 5 in both kinds of tests. Children in a large city 
did as well as their small-town counterparts on convergent thinking but 
less well on divergent thinking; this last result is intriguingly reminiscent 
to the reviewer of the old claim that most scientists come from rural areas. 
The study might have been more concise had the investigators obtained 
factor scores or even kind-of-thinking scores for comparison, instead of 
presenting many comparisons in which the tests are, at least by definition, 
correlated with tests in other pairs. The significance tests for such pairs 
are not really independent. . n 

Using ini modeled after those factored by Guilford and his associates, 
Jones (1964) obtained a multiple-R of 0.67 for managers’ ratings of 
creativity of industrial chemists, predicted from measures of logical reà- 
soning, mathematical reasoning, ideational fluency, originality, and five 
attitudinal variables. In a study of seventh graders, Merrifield, Gardner, 
and Cox (1964) found that teachers’ ratings of what the si gl ueni 
"problem-solving syndrome," which was made up of Sas separately 
rated traits, was predicted to a significantly greater degree d a pose 
of IQ scores, aptitude scores, and temperament trait scores than 1 M 

Scores alone. 

Petersen and others (1963) observ 
abilities appeared to contribute to o 
general mathematics at the ninth grade lev pa rom 
the two g i "urthermore, the addition of measures fr 
fena pong crga the structure-of-intellect xq er un 
—to measures from standard tests increased the malRple- or alg 
achievement but not for general mathematics achievement. tasks designed 

Merrifield and others (1962) showed that p _—— of the 
to measure problem solving could be predicted in m 4 pace wm 
abilities called conceptual foresight, sensitivity to pro ems, " 


Additional Reference: Super and Crites (1962). 


ed that a larger number of separate 
s in algebra than to success in 
el. The patterns of abilities for 


New Measures 
reviewer i r lly 
i i d ; that the reviewer is personally 
The reader will have recognized by now 


involved in much of the recent research concerned with special abilities. 
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Accordingly, and with a modicum of apology. this section begins with 
citations of several studies stemming from Guilford’s laboratory. Guilford 
and Merrifield (1960) presented a detailed formulation of the structure 
of intellect and offered suggestions for its use in the development of new 
measures of ability. Subsequently. Guilford and others (1961) reported 
the discovery of factors in cognition and convergent thinking dealing with 
symbolic material—i.e., numbers. letters. other arbitrary signs. Using the 
same battery for adults as has been employed for ninth graders. Gershon. 
Guilford, and Merrifield (1963) found evidence for several factors in the 
area of divergent thinking with figural material space forms, shading. 
and other perceptions not having familiar meanings. Guilford and others 
(1962) reported their initial efforts in developing tests to measure abilities 
in the area of behavioral (social) intelligence—the ability to think about 
the thoughts and feelings of others. Results since 1960 were 
by Guilford and Hoepfner (1963). 

Elsewhere, Stott and Ball (1963) carried out an extensive investigation 
of interitem correlations of infant and preschool tests, I 
showed rather clearly that each of the different. tests 
factors and that there was little concordance of one test with another. It 
is not surprising that the degree of prediction of Binet-type IQ measures 
has been so low. Since the factors differ. there is little opportunity for 
correlation to ensue. 

Lesser, Davis, and Nahemow (1962) 
in elementary school children of high general ability. The test is said to 
call for eduction of scientific principles implicit in early items: administra- 
tion is individual, with oral response being the rule. 

Nunnally, Flaugher, and Hodges (1963) devised an interesting test to 
measure whether the respondent tends to ey 
gorize. Reasonably high estim 
that the test will measure an 
in academic or vocational pl 


summarized 


"actor analysis 
contained several 


presented a test of scientific talent 


aluate. to denote, or to cate- 
ates of reliabilities were reported, It may be 


attribute, knowledge of which will be helpful 
acement. 


Summary 


What, then, are the trends in the me 
sees. for the immediate future. the 
onally depicted measures of separate abilities. probab 
of the structure of intellect. The issues of v 
distribution will receive greater a 
development. 


asurement of special abilities? One 
continued active development of orthog- 
ly within the plan 
alidity, reliability, and score 
ttention, especially in the area of test 


It is near the time when teachers must f. 
children whose abilities are demonstrably 
be enough to enhance those abilities alre 
an approach could lead to a nation of 
outside their specialties. Educators must 
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ace the problem of training 
different in detail. It will not 
ady emergent in the child—such 
specialists unable to communicate 
attend also to the development of 
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abilities in Sidi c 
an adulr Miser ues has nöt become aware of his potential. so that as 
in an unstable world Berl : Meinen of coping with changing requirements 
man was his ability te laps k je major characteristic of the Renaissance 
linen amher. n ups with his changing environment. albeit some- 
well om n a we is a groundswell. not yet strong enough to be 
live to individual i mii that programed learning performance is sensi- 
es im well us E differences; perhaps programs to enhance special abili- 
Dethe bei bo i facts will be developed. 
conċèin wiih Ces sum of appropriate criteria. there will be. as always. 
analen uf a tanion problem. It may be fruitful to carry out logical 
rather than in rains 3n terms of known and hypothesized special abilities, 
terion measure eume of segments of the composite task on which the cri- 
well re is typically based. With the availability of computers as 
s for use as predictors, one may expect a 
s—multiple-regression, discrimi- 


alysis. profile < s ^ 3 
) profile analysis—appropriate to the requirements of multi- 
ent of special abilities. 


d problems i , 
Man's reris involved in the measurem 
s thinking. like hi : : 
nking. like his love. is a many-splendored thing. 


as did 
contin 2 promise of new te 
: ued increase i : 

nant an ase in multivariate analysi 
facete, 
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CHAPTER IV 


Trends in the Measurement of Educational 
Achievement 


HAROLD F. BLIGH 


Conditions in American education since Merwin and Gardner prepared 
their chapter concerned with tests of educational achievement in the 
February 1962 issue of the Review have been favorable to the develop- 
ment of systematic research in the field of achievement testing. k 
In its provision of funds and materials available for research, the Na- 
tional Defense Education Act focused attention on the need for controlled 
studies that could be replicated. In addition, grants made available by the 
U.S. Department of Health, Education. and Welfare and by other govern- 
mental and private agencies served to encourage the initiation and com- 
pletion of systematic and intensive research. The availability of 


accurate. 
economical central-scoring and d 


ata-processing facilities opened up to test 
publishers as well as to other test developers opportunities to enlarge 
samples and to execute studies involving detailed e 

During the past year, two new journals appeared. 
of Schutz (1964), the Journal of Educatio 
cation of the National Council on Me 


original research and reports of applications of measurement. Published 
by the American Educational Research Association and edited by Davis 
(1964), the American Educational Research Journal furnishes reports of 
experimental and theoretical studies in education. The inclusion of the 
“Validity Studies Section” (Michael. 1953) in Educational and Psycho- 
logical Measurement served to encourage public 
performance of achievement tests at loc 
tions. It is hoped that more pe 
level will design their studies wi 
one of the several available me 


xperimental designs. 
Under the editorship 
nal Measurement, official publi- 
asurement in Education. publishes 


ation of reports of the 
al levels in practical testing situa- 
rsons carrying on research at the local 


th the view of publishing the findings in 
dia. 


Test Development 


Objectives and Content 


Hoffmann (1962) severely criticized 
employing multiple-choice items—as 
ures of factual recall. He contend discriminate against the 
brig is i 3 a aerate 
ie dn ies s e In an attempt to. demonstrate. that multiple- 
choice items can be vali s She AS 

an be valid Instruments for measuring thought, insight. and 
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representing mere 
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und i 
ee Moa Ta = snipes ieee Division of the Educa- 
of multiple-choice quien ac SORT p i gens y 
mills. S (rp yn by statistical analyses of item 
iced AE e s db uded brief discussions of the thought processes 
M E ee valuable as a guide to item writers, this descriptive 
n aod er experimental evidence that the items performed as 
pnis iei un bas and Kropp (1964) attempted to describe 
Wits E n sses is test items were intended to measure. In the 
obiil z^ tree subject matter experts independently classified 683 
oe ade a ps to an eight-level taxonomy of intellectual processes that 
The aon. rom Bloom's (1956) Taxonomy of Educational Objectives. 
Ree poe apad agreed on the classification for 61 percent of the 
pe ts : bs an additional 13 percent. one rating diverged from the 
cape S only one level. Two or more of the raters agreed on 93 
fee a the items. An attempt to determine the degree of correspondence 
idis ie intellectual process actually employed by examinees and that 

= bed by the experts provided results of limited usefulness. 

e wares tre ied of results for a comprehensive examination 
ne =~ ed according to a revised taxonomy, McGuire concluded that the 
defect each level of the taxonomy appeared to be measuring somewhat 
dea intellectual skills. Findings from factor analysis of the correla- 
dus jatrices supported the hypothesis that there isa fundamental kind 
nowledge, understanding. or intellectual skill basic to all processes. 
Nes and Kropp used two reading comprehension tests in which items 
Te distributed over the six major categories of the Taxonomy of Edu- 
cational Objectives. Agreement among judges independently classifying 
the items was sufficiently high to warrent the conclusion that raters tend 
to categorize test items with respect to the cognitive processes sought in 


the items. In general, the data supported the hierarchical structure of the 
acement of the evaluation category. 


and Wantman (1964) explored the 


feasibility of tape-recording the responses of examinees as they thought 
aloud in responding to objective test items. This exploratory study, which 
Was based on a limited sample of nine subjects, pointed up some of the 
difficulties experienced by McGuire (1963) in drawing sound conclusions 


from her findings. 


Ts E : : 
UAR with a possible mispl 
n a novel approach. Connolly 


With the advent of modern curricul ally in the fields of 
mathematics and science, test developers are faced with the problem of 
instructional goals not adequately 


o measure i t ade 
Heath (1961) attempted to identify four 


high school physics and to compare them 
Committee courses of study. 
School Physics. written 
ntion that the cognitive 
al and new science 
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courses may be expected to differ significantly. Achievement tests struc- 
tured according to traditional objectives appear to be inappropriate for 
assessing the goals distinctive to the newer curriculums. 

Mayo (1964) and Rust. Jones. and Kaiser (1962) used different ap- 
proaches to describe test content. Mayo reported preliminary findings of 
an attempt to define competences in measurement that are needed by the 
beginning teacher. Persons who were generally recognized as having the 
background to judge what the beginning teacher ought to know about 
measurement rated 70 competences expressed in behavioral terms. Collec- 
tively, the judges felt that the majority of the competences were important; 
only 2 of the 70 statements were rated “of little importance" by a ma- 
jority of the judges. Statistical concepts were considered to be least im- 
portant, whereas competences associated with criteria for selection and use 
of standardized tests and with the construction and evaluation of classroom 
tests were judged as “high.” 

Rust, Jones, and Kaiser explored the domain of critical thinking by 
factor-analyzing 21 subtests derived from Jl'atson-Glaser Critical Thinking 
Appraisal, the American Council on Education Test of Critical Thinking, 
and A Test on Principles of Critical Thinking devised by the staff of the 
Illinois Critical Thinking Project. They identified two strong factors: gen- 
eral reasoning and language discrimination. A third factor, labeled verbal 
understanding, was not clearly defined. 


Additional References: Costin (1963) ; Heath (1964) ; Hipsher (1961) ; 
Hurst, Ronning, and Bartlett (1961). 


Technical Aspects 


Research on the more technical aspects of test construction was spotty, 
with a wide range of topics being explored. In a series of four well-defined 
experiments at the college level, Brenner (1964) investigated the effect of 
different orders of item difficulty on estimates of discrimination, reliability. 


and difficulty level of four-option multiple-choice tests of educational psy- 
chology. He found that there were no significant differences for test diffi- 
culty and test reliability and that only one of the four differences in dis- 
crimination value was significant at the 0.05 level. It appeared that class- 


room instructors gained little from the time and effort spent in arranging 
items of a power test of facts and principles in order of item difficulty. 

The extent to which test scores are contaminated by the effects of re- 
sponse sets, chance, failure to attempt all items, and like factors concerned 
test theorists and test users alike. Hopkins (1964) discussed the effects of 
response styles and concomitant chance variance on the California Achieve- 
ment Tests (CAT), SRA Achievement Series, and lowa Tests of Basic 
Skills. In certain instances, a score near to or even at grade level status 
was obtainable by chance performance alone. Conventional reliability esti- 
mates for total scores on the California Arithmetic Test and for scores in the 
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Tea i 3 r; 
eae ane i be: arithmetic test in the SRA Achievement Series 
ae mea aie ae the differences in their respective extrinsic 
to the attenuation of : a riii DE ed suggested bd approaches 
wars [Bod cd seusel. 5 b ^ an hoses (a) the development at every 
adoption of dif l- est orms differing only in difficulty and (b) the 
"am n T A un scoring methods at various ability levels. 
Tont aiteeneie m study at the college level in which he used data from 
iln bition ry orms of 100 items each, Marcus (1963) investigated 
E * P apes response sets on scores of a multiple-choice 
stable nor mes . e Ha of position response sets, which was neither 
sisglieible. e js is from test form to test form and, in the main, was 
shasta lf Mar EM that a. promising line of research would be to 
a "s * effect from item to item and the effects of equalizing 
pis e the distractors. 
seslures; ro wipes concerned with the effectiveness of item analysis pro- 
applied tions “i scores on an essay test as a criterion, Myers (1964) 
Wesiilts dor : em analysis techniques to a multiple-choice test of 32 items. 
validation Bi m analysis sample with 300 cases and for three cross- 
vndinita Sac es each containing 100 cases showed the item-test biserial 
and the n wi I reliability index. the standard length of the item, 
about equall e e item validity index to the item reliability index to be 
and valid t y effective procedures in identifying items to comprise reliable 
Feld ests. 

"i ege Hall (1964) studied the sensitivity of four commonly used 
this eami ion indexes to variations in the average achievement level of 
GÍ the tx de group on which they were based. Flanagan s approximation 
rub oduct-moment coefficient, the Davis transformation of the product- 

coefficient, the Gulliksen item reliability index. and the difference 


bet 

tw . $ 

een the proportion of correct responses in the upper and lower 
hoice items in four sub- 


| ne groups were derived for multiple-c a e 
ooo for six reference populations. one of y guns d 
studied [^ independent of the achievement level o Cg manr aion 
amo 'd. The mean values of each index. which differe significantly 

ng the reference groups. tended to rise as the ability of the reference 


er i H H B 
Sroup increased. There was a marked relationship between item difficulty 
isle set of items. It would appear that 


an ne discrimination within a sing e E erm oe 

items nstructors should examine carefully the pone ing um oi 

ite at successive grade or ability levels as well as the prog" 
2. difficulties over the range covered by a test. 

ing E and Reade (1964) investigated the effects on à 

vin ifficulty levels of tests given in à required cou dt 
easurement. The index of test difficulty was expressed as the ratio 0 


Mean score to number of test items. right students who took difficult 
*Xaminations did obtain significantly higher scores than did equally bright 
Students taking easier examinations. The difficulty level of the tests was 
independent of achievement for students in the lower half of the class. 
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The authors suggested that teachers use fairly difficult examinations to 
motivate bright students during the semester and easy examinations for 
grading purposes at the end of the semester. 

Archer (1962) studied the effect on test reliability. concurrent validity. 
and item characteristics of three methods of responding to multiple-choice 
items. In comparison with a conventional approach. the two experimental 
approaches—elimination and free choice—failed to provide information 
that was either more reliable or valid. The elimination procedure was 
superior to the free choice method with respect to reliability, but not with 
respect to validity. Relative item difficulty was little affected by the method 
of response. Stability of item difficulties across procedures was noted for 
easy and difficult items as well as for items of average difficulty. Higher 
levels of item discrimination were associated with the conventional method. 
especially for easy items. 

The use of formula scoring to correct for random guessing on objective 
test items has not proved to be completely satisfactory in situations in 
which examinees’ willingness to guess differed considerably. Lord (1964) 
reported empirical findings that showed the amount of loss in test validity 
that would have resulted if each examinee had randomly guessed instead of 
omitting certain items. Test intercorrelations consistently declined from 
0.03 to 0.06 when randomly chosen responses were substituted for omitted 
responses on both tests. These findings suggest that the practice of requir- 
ing each examinee to answer every item on a test should be viewed with 
caution. 

In investigating the relationship of formula scoring to validity, Lord 
(1963) applied a mathematical model to seven hypothetical tests. For five- 
choice items, formula scoring produced a gain in validity about equal to 
that produced by lengthening the test by 20 percent or more, For two- 
choice items, the increment in validity was greater than that which could 
be produced by extending the test to any length whatsoever. He concluded 
that formula scoring appears to be relevant when (a) the examinees vary 
widely in propensity to guess, (b) the number of choices per item is fewer 
than five, and (c) the test is quite difficult for the group tested, 

Vernon (1962) carried out an investigation designed to elicit the relative 
variance associated with test content, method. and error components with 
a group of 108 British and 75 American college students. The general trend 
of the results did not support the author's initial hypothesis that certain 
weaknesses in current achievement tests, such as poor differential predictive 
capacity, may result from neglect of method of responding to a test. 
A reading test in which questions were neither seen nor answered until 
after reading of the passage was completed appeared to be measuring a 
somewhat different ability from that measured by conventional reading 
tests involving immediate comprehension. The former test was considerably 
more valid than was the latter in the prediction of academic achievement. 
Test questions to which students wrote the 
to involve an ability different from that me 
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— ine were concerned with college students" attitudes toward 
BS elm : v testing procedures. In a limited investigation involving 
T Ue i quest Feldhusen (1961) discovered that students’ reactions 
= td pem were highly favorable: post-mortem discussions of quiz 
= i us less favorably viewed. Nevertheless. three-fourths of 
a oe aat pitch discussions helped them to avoid foolish and 
a rs in subsequent testing. Using 19 pairs of college students 
iced n misil ability and factual knowledge of American govern- 
. Selakovich (1962) found no significant difference between the 
fos piven 4 and the group tested only three times a year. 

mid rim group was strongly in favor of frequent testing. 
*. n : ID 51) investigated college students’ preference for test formats 
n. Plot p of such attitudes to grades earned. Student responses 
Gibbs onto administered during the first week of the term were 
an ci si os with other test scores. Only a weak relationship 
Additional [Oc test grades and expressed preference for test format. 
Feldhuze LE Boag and Neild (1962): Calhoun (1962); 
ei Wein 1 61) J Glass and Wiley (1964); Graham (1963); Gruber 
Little Yr (1962) : Gustav (1963); Hawes and Helmstadter (1963) : 
62); Lord (1960); Myers (1962): Rimland and Zwerski 


(1962): Sax (1962); Wevrick (1962). 


3 à 
group given frequent tests 


Techniques of Administering and Scoring 


ae growing popularity of radio and television as media of instruction 
of hac, w a séries of investigations into the feasibility pi various modes 
by Grp DM. The most extensive investigation bus that reported 
Steen : and Kropp (1962). By projecting all the directions on hien 
esis pl explored the possibility of administering group standar j^ 
result a" achievement tests. the projected audio presentation Re s 
Siegen not differing from the projected paces pei 
fes oe rations. However, audiovisual presentation may "i pe à 
Adapt de subjects. visual for others. and audi l for noie ie mp y 
testi ation of computational arithmetic items to the expe rimen al mode | 
ee ing presented difficulties because es in kinds of abilities 
"e by differing techniques of test > The a woe 
=e concluded that the exposure of three items simultaneous y—one o 
average difficulty, one of more than average difficulty. and one of less than 
a HM difficulty—did yield results more similar to those obtained from 
Conventional administration than did other modes of exposure. 
, Burr (1963) studied the relationship among sets of scores earned by 
Sixth graders on the social studies test of the Metropolitan Achievement 
ests. He used three modes of item presentation that were employed in 
the foregoing investigation in conjunction with two modes of instruction— 
televised and kadinal The projected silent and audio-reinforced pro- 
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jected presentations produced results comparable to those obtained from 
conventional administration for both modes of instruction. The experi- 
mental techniques utilized less time than the conventional approach. . 

Long (1962) found supporting evidence for Burr's tentative conclusion 
that if audio reinforcement of visually projected directions does affect 
test performance, such effects will be found at a reading grade level below 
4.7 and above 8.7. He reported significant differences between the mean 
scores of educable mentally retarded elementary and secondary school 
pupils on subtests of the Stanford Achievement Test concerned with word 
meaning and arithmetic reasoning relative to whether they were examined 
in the conventional manner or given oral reinforcement of the directions, 
item stems, and response options. 

With the advent of new scoring machines in which a variety of answer 
space configurations and color designations is employed, the question of 
possible effects on answer-marking performance arises. Test users should 
rightly ask whether the norms derived with one answer medium apply to 
scores obtained with differing types of answer media. Published reports of 
research in this area were limited, especially reports in readily available 
and well-known professional journals. 

Using three samples varying widely in age and educational experience, 
Miller and Minor (1963) compared eight potential new multiple-choice 
answer forms to the standard IBM 805 answer sheet form in an attempt to 
determine whether an answer form could handicap a person's test per- 
formance and thus invalidate established test norms. Of three criteria 
studied (total right responses, total erasures, and total double marks with 
no erasures), only total right scores reflected differences that were statisti- 
cally significant and judged to be of practical importance. 
among forms were attributed both to the configurational facto 
in the grouping and to the labeling of the answer slots. Vertical grouping 
of slots and failure to label each slot with an individual number seemed 
most detrimental to answer-marking performance for all three age groups. 


Three of the experimental styles appeared to be unsuitable for use with 
tests standardized on IBM 805 forms. 


Additional research is needed on th 
and other adaptations entailed in the 
ing from that used in standardizing a test. Although results may show no 
significant differences in mean performances, consideration should be 
given to significant discrepancies which may occur at other points in the 
score scale, particularly at the lower end. 

Additional References: Gerlach and others (1964) ; 
(1962) ; Loiselle (1961) ; Slater (1964) 


Differences 
rs involved 


e possible influence of color, format, 
change to an answer medium differ- 


Kropp and others 
Reference Groups and Norms 


It has been generall 
tests differentiated wit 
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to the interpretation and use of the test results. During the past three 
years, there has been some empirical research that suggests that further 
consideration be given to the differentiation of test norms. 

_ In two well-designed studies, Bauernfeind and Blumenfeld (1963) 
investigated possible differences in educational achievement of public and 
Catholic school pupils as measured by the SRA High School Placement 
Test at the eighth grade level. Pupils matched for general reasoning 
ability, sex, and geographic region were compared on the arithmetic and 
reading tests in the first study and on the language arts, arithmetic, and 
reading tests in the second study. In the first analysis, Catholic school 
pupils earned mean grade equivalents approximately one year above those 
of pupils in public school on the two criterion tests; in the second analysis, 
the mean grade equivalent scores for Catholic school pupils were 0.45 
years higher on all three tests. All differences were significant beyond the 
0.01 level. 


Using a stratified sampling technique. Wozencraft (1963) found a real 


sex difference in favor of girls for total groups and for average ability 
groups in the third and sixth grades, when they were tested with the 
Stanford Achievement Test. At the third grade level, girls in the total and 
average groups were superior in all four areas tested; at the sixth grade 
level, girls in the total group excelled the boys in average scores in read- 
ing, arithmetic computation, and the arithmetic composite. In the average 
ability group, the significant differences exis 
Putation and in the arithmetic composite. 
Parsley, Powell, and O'Connor (1964) analyzed results on the CAT 
9r an entire school population, grades 4-8, and found that significant sex 
ifferences did exist within three achievement levels for various 1Q groups. 
n general, the results indicated that girls excelled in reading achievement 
and in arithmetic fundamentals, whereas boys tended to excel in arithmetic 
reasoning. On the basis of their findings, the authors advised against the 
Use of average scores or a composite of measures in arithmetic. 

In the Project Talent report. The American High-School Student, po 
gan and others (1964) noted the following general findings on sex re ed 
ences in achievement among high school students: (a) on a test of in = 
mation, girls in grades 9-12 did finish slightly more items on the average 
than did boys; (b) almost all the variables for which the reliability co- 
efficient was markedly lower for one sex than for the other involved subject 


Matter in which there were marked sex differences in amount of interest; 
the correlational patterns of 


€) the high d f similarity between 
A gh degree of similarity " id 
Various tests f cvested that the factorial composition 
of tests sugges x 
— di or boys as for girls; (d) 


of the tests w. ; h the same fi 
=e a hroughout the middle and 


little difference between boys and girls existed t 1 : 
in reading comprehension, 


top of the distribution of scores on a test. 2 odd 
although toward the bottom of the distribution boys scored lower than 
girls; (e) girls exceeded boys on a test in arithmetic computation at grade 
9, but by grade 12 there was little difference: (f) sex differences in a 


41 


ted only in arithmetic com- 


No. I 


REVIEW OF EDUCATIONAL Res 


ARCH Volume XX Y. 


test concerned with arithmetic reasoning and in an achievement test cover- 
ing intermediate high school mathematics were not significant. although 
by grade 12, scores for the boys were substantially higher: and (g) boys 
obtained higher scores on tests of vocabulary at all grade levels. This last 
finding might be attributed to difference in subject matter content. from 
which the vocabulary words were taken. 

Miller and Minor 11963) found that females did achieve consistently 
higher total rights-scores than did males on each of eight forms of answer 
media. They suggested that sex differences in substitution tasks such as 
those evoked might be less a function of ociation learning than of 
fundamental perceptual motor skills. Further investigation is needed before 
one may conclude that tests administered with standard answer forms 
favor girls regardless of test content. 

Norms and expectancy tables were compiled to supplement those given 
in test manuals. Wrightstone, Aronow. and Moskowitz (1963) derived 
norms on the Metropolitan Achievement Tests, Elementary Reading Test, 
Test 2: Reading from the raw scores of 5.307 pupils in 73 special schools 
for the deaf in the United States and Canada. Percentile ranks were 
reported for age groups rather than for grade groups; stanines referred 
to as Standard Reading Ratings were based on all subjects. irrespective of 
age or grade. 

Additional References: Feldmann and Weiner (1061): Hall (1963) : 
Parsley and others (1963). 


Interpretation 


Values to be gained from interpre 
tion to capacity as measured by 
Durost (1962) discussed the import 
the use of the bivariate chart in 
that the scores on the tests be e 


ting achievement test scores in rela- 
an intelligence test. were recognized. 
ance of such comparisons and stressed 
illustrating relationships. He advocated 
xpressed in units that are directly com- 
parable: that is. in terms of local stanines. Engelhart (1962) likewise 
described the advantages to be gained from applying stanines to the inter- 
pretation of achievement and mental ability measures. 

Juola (1961) analyzed ways in which one may relate several relevant 
ability measures to a criterion so that the r s 
administrators, counselors. and students. He contended that the expectancy 
table approach was particularly advantageous because the entries in the 
cells of the table appear as easily understood probability figures expressed 
in percentage forms. 

Using standardization. data for three achievement batteries and six 
independent populations at five different grade levels. Mitchell (1963) 
investigated the feasibility of applying expectancy tables based on data 
for individual pupils to the interpretation of school averages, On the basis 
of 19 comparisons of the regression line for individual pupils with that 
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Comparability of Tests and Test Scores 


t scores continued to be an enigma 


Th 

e pr m 

bd Du of comparability of tes 

dii developers and test users. Two aspects of the problem were generally 
£nized: (a) the nonequivalence of norm groups and (b) the marked 


conte 2 
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rs differences of tests intended to measure the same traits. During 
| püst three ww s : 

ast three years, several well-planned studies were concerned with at 
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Te first of these two aspects. 
fiin oe and Crandall (1962) compared the complete battery of CAT. 
ests of Basic. Skills. Metropolitan Achievement Tests. Stanford 
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Achievement Test, and the SRA Achievement Series. They used fifth and 
eighth grade groups that had been matched on a variety of factors, includ- 
ing mental ability as measured by The Henmon-Nelson. Tests of Mental 
Ability. Wide variations in mean grade placement were reported among 
the tests. In general, pupils taking.the CAT consistently earned higher 
grade placement scores. whereas those taking the Metropolitan Achieve- 
ment Tests tended to earn lower scores. 

Using the Jowa Tests of Basic Skills, Metropolitan Achievement Tests, 
and CAT, Finley (1963) attempted to make the content more nearly com- 
parable by limiting a study to reading comprehension, arithmetic reason- 
ing, and language usage. Finley’s findings supported previous results that 
CAT scores (1957 norms) did tend to run consistently higher than did 
scores on other test forms—in some cases as much as a year higher on 
the average. Differences between Metropolitan and lowa tests were non- 
significant except for language usage, for which the Metropolitan average 
was lower than the Jowa and CAT averages. 

Although recognition of theoretical and technical problems involved in 
establishing comparable scores for nonparallel tests was voiced by leading 
test theorists, little empirical information was reported in the literature. 
Angoff (1964) considered tables of comparable scores to be limited since 
their usefulness is a function of (a) the similarity of the tests for which 
comparable scores are to be developed; (b) appropriateness of the groups 
on which they are based; and (c) the amount of error that can be safely 
tolerated for the specific use under consideration. He defended the devel- 
opment of comparability tables that are derived from differentiated norm 
groups over those based on national norm or local norm groups. Lennon 
(1964) reported the development of sets of scores for end-of-course tests 
in the Evaluation and Adjustment Series, which had been rendered com- 
parable through use of an anchor lest approach. A student's score on any 
series denotes the same relative performance in a 
hypothetical population having a specified mean or standard deviation on 
the Otis Quick-Scoring Mental Ability Tests, Gamma Test. 

Additional References: Baker (1964) ; Bereiter (1964) ; Cassel (1963) ; 
Ebel (1961) ; Flanagan (1961) ; Lindquist (1964). 


Prediction and Classification 


Prediction and validation studies of the nature of those reported in the 


"Validity Studies Section" of Educational and Psychological Measure- 
ment, in bulletins of the Educational Records Bureau, and in various state 
publications contributed to improved understanding of the performance of 
specific tests with well-defined academic &roups. Numerous studies per- 
taining to a wide range of test and age levels were found. The reports of 
Anderson (1961a, b) and of Flanagan and others 
interest. 
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At the elementary level, Anderson found that the California Test of 
Mental Ability contributed little to the prediction of midterm grades in 
reading, arithmetic, language, and spelling at the fourth and sixth grade 
levels when it was used with CAT subtests, which were administered near 
the grading period. 

In an analysis of Project Talent data for twelfth grade boys, Flanagan 
and others found that those tests most directly related to high school 
subjects and those having very high components of general verbal ability 
best predicted whether a boy will enter college. Exhibiting a zero-order 
correlation of 0.52 with college entrance and the largest beta weight, a 
test of mathematics information was the best single predictor and the best 
predictor in a combination of 51 test variables. The second part of a 
mathematics test, which measured abilities of somewhat the same nature 
as those in the test concerned with mathematics information, carried the 
second largest beta weight. 

There was continuing use of achievement tests during the past three 
years for identifying and describing the characteristics of underachieving 
and overachieving pupils. Recognition was given to methodological weak- 
nesses and faulty designs in previous studies as well as to the need for 

efining operational procedures so that replication would be possible. 

Techniques for selecting underachieving and overachieving samples were 
classified and compared in two studies. Farquhar and Payne (1964) 
recognized four somewhat distinct classification techniques. Comparisons 
Were made of seven operational procedures that had been utilized in 
Published studies and of a regression procedure that had been developed 
according to characteristics of an ideal selection technique. The number 
of individuals identified as underachievers and overachievers differed 
greatly relative to the particular technique used. There was a wide sex 

ifference in the number of individuals selected by various techniques. 

Pippert and Archer (1963) found that grade point average and com- 
Posite score on an achievement test failed to identify the same ninth 
graders as underachievers. Underachievers selected by achievement test 


iteri id those 
Scores performed significantly lower on criterion measures than did thos 
grade point average. 


Students who were identified on the basis of g ; 

Outstanding theoretical contributions were made by Thorndike (20) 
aud Willingham (1964) through their attempts to clarify the construct 0: 
relative achievement and to prescribe conditions under which more aiian 
ingful research could be conducted. Thorndike reviewed existing cin 
Studies on underachievement and overachievement and neem , 
odological pitfalls arising in part from defective experimental npo rm 
in part from inherent deficiencies in the terms used to ^p designing 
achievement. He recommended precautions to be observed in gning 


future research in this area. 


Willingham defined relative achievement in ter 
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tion of relative achievement and pointed to the need for empirical research 
designed within carefully formulated frames of reference. 

Additional References: Boe (1964); Crouch (1962); Gnauck and 
Kaczkowski (1961); Haney, Michael. and Gershon (1962) : Michael and 
others (1961); Pauk (1963) : Richards and Taylor (1961) : Swanson and 
Berdie (1961) ; Wellck (1963): Yager and Dessel (1963). 


Evaluative Studies 


Only brief reports of studies evaluating new tests or revised tests were 
available. Oldridge (1961) reported on the concurrent validity of the 
reading and arithmetic sections of the Wide Range Achievement Test 
(WRAT), which was administered to seventh graders one week after the 
CAT had been given. Correlations of the scores on two subtests of the 
WRAT with scores on the arithmetic and reading subtests of the CAT 
and with total scores on the CAT ranged from 0.65 to 0.80 with a median 
of 0.72. Although the WRAT did tend to yield lower grade placement 
scores than did the CAT, the differences were not significant. Possible 
effects of practice and/or interim learning were not investigated. 

"Three studies were concerned with subtests of the Sequential Tests of 
Educational Progress (STEP). Using a random cluster sample of fifth 
grade pupils, Anderson and Baldauf (1963) investigated characteristics 
of Form 4 of the Listening Test. The coefficient of equivalence was 0.74. The 
coefficient of stability for Form 4A was 0.74; for Form 4B, 0.88. Correla- 
tion of scores on the Listening Test with total scores on the Stanford 
Achievement Test was 0.20 when mental ability was held constant. Results 
from factor analysis indicated that the factor of verbal comprehension 
seemed to account for over half of the variance. 

Gega and Karlsen (1962) studied performances of fifth and sixth grade 
pupils on the regular form of the STEP science test and on a version in 
which the items were recast in a self-contained form unrelated to the 
regular situational format. They concluded that format made no difference. 

Using a 10-percent random sample of the entire group of approximately 
9,000 students tested in the 1957 National College Testing Program. 
Fortna (1963) performed a factor analysis with the STEP and the School 
and College Ability Tests (SCAT). The SCAT verbal, STEP reading. 
STEP writing, and STEP social studies tests had the highest loadings on 
a verbal comprehension factor, whereas STEP mathematics, SCAT quanti- 
tative, and STEP science tests defined the numerical facility factor. A third 
factor, unnamed, was significantly loaded in the STEP w riting test. 

Additional References: Anderson and Slivinske (1963) ; Hopkins, Dob- 
son, and Oldridge (1962) ; Howell and Weiner (1961) : Leton, Anderson. 
and Stier (1963) ; Sheppard and Campbell (1963). 
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Conclusion 


During ; 
DAE e put thee sere ar baue Bean oie Snare 
tional achievement. Att 7 € stu ies pertaining to tests of educa- 
femilile Gore : : empts to delineate fundamental issues and to 
fruitful. Mei — or frameworks have been particularly 
planning of amal rest consideration needs to be given to precise 
im EE al estigations so that the time and efforts spent in 
n menting them will not be wasted. 
Negra of achievement tests to assess the educational goals being 
intimo d veia curtisuline offers a real challenge. especially when 
dinis wc Ag. uina is placed on development of concepts and under- 
The e din cal pam acquisition of knowledge and application of facts. 
ie hea ea a € e e the cognitive processes sought in test items appears 
anew in the right direction. Existing item types need to be refined. 
w approaches need to be developed if these newer goals are to be 


asse i 
essed in an adequate manner. 
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CHAPTER V 


Trends in the Measurement of Vocational Interest 


JOHN N. MC CALL 


pm. bi material published during the 1961-64 review period, some 
E D erences will be cited in this chapter. Lack of a previous major 
— n interest measurement as well as the writer's wish to present a 
DR cd A prompted the decision to extend the period of coverage. 
(1955) penere! reviews appeared in the volumes by Darley and Hagenah 
Do us uper and Crites (1962) as well as in the papers edited by 
ak 0). The more technical problems concerned with the con- 

ction of interest inventories were discussed by Kuder (1957), Ghei 


(1959), Clark (1960), and Layton (1960). 


Past Trends 


Accepted Methods 


manifest, tested, and inventoried 


In their distinction between expressed, 
d major ways in which applied 


nies a Super and Crites (1962) describe ; 
Sd n D ae measure vocational interests. The most widely accepted 
Der v I far has been the inventory approach. in which subjects indicate 
‘tame G terences, or relative likes and dislikes, for a long list of specific 
wale, ` riteria for selecting items and for scoring them relative to specific 
(b) ] ave been (a) conformity to standards of logic or common sense, 
Hae nomogeneity of items as reflected by high intercorrelations among 
wad ( within a cluster and low correlations with items in other clusters, 
E c) empirical differentiation, in which item responses for members of 

ected occupational groups must differ from the responses of individuals 


in s; 
T relatively general reference group. " 
he two most thoroughly researched instruments are the Kuder Prefer- 
ence Record—Vocational and the Strong Vocational Interest Blank for 


Me É à : 
en. The former illustrates homogenous keying, or scale construction, 
The ease of scoring and 


whereas the latter illustrates empirical keying. a 
eading comprehension as well as the inclusion of broadly defined activity 
areas have made the Kuder Preference Record —V ocational a popular 
instrument. Evidence which has pointed to considerable stability as well as 

to long-term predictive validity for the Strong Vocational Interest Blank — | 
(SVIB) has helped to establish it as both a research and counseling tool, 
although it is best used with persons who are oriented toward the profes- 
Sional and higher level business occupations. 
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Persisting Problems 


Finding an acceptable taxonomy of interests remains a vexing problem. 
To assume that interest scores reflect intrinsic satisfaction with given 
tasks, or occupational activities, seems to suggest an easy solution. Actually. 
there is no clear-cut, completely objective system of occupational classifica- 
tion, Existing schemata, which are quite arbitrary, often invoke use of 
overlapping criteria of classification. Many jobs with the same name differ 
considerably. Moreover, persons in a given job may utilize different psy- 
chological functions. Factor analysis has been used both to identify dimen- 
sions of interests within a large cluster of items and to find which 
dimensions best describe the intercorrelations of measures for existing 
inventories or for their subscales. In their useful summary of the factors 
obtained in several studies, Super and Crites (1962) furnished evidence 
of considerable agreement in the identification of the factors. More re- 
search is needed to give these factors psychological meaning. 

Empirically keyed scales, such as those in the SV ID, name interests 
simply in terms of the occupational group that is differentiated from a 
general reference group. Since the duties of the personnel within a given 
occupation change with time, revisions are necessary. It is also impractical 
to develop scales for all occupations. In addition. some form of ordering 
schema with existing scales is needed. Strong's method of grouping closely 
intercorrelated scales was a partial solution. However, it has been dificult 
to interpret the interests common to some of these occupational groups. 
Another complication is the possibility that any given organization of 
interests changes with age or with progress into different stages of social 
responsibility. For example, the various intrinsic and extrinsic deter- 
minants of work satisfaction and occupational choice might shift as an 
individual changes from a self-expressive to a more socially responsible 
adult role in society. 

Basic to the discussion of the preceding paragraph is Berdie’s (1960) 
point that there has been no sound theoretical basis for vocational interest 
measurement. Consequently. there is no meaningful way in which one may 
compare scores based on different measurement operations. There is also 
the difficulty of specifying the relative strength of interests. Criteria such 
as the size of a single score, or of several related scores, and the rank 
order of scores are not easily justified. This crit 
difficult for ipsative scales, such as the Kuder Preference Record—V oca- 
tional, and for the SVIB, which measures how much groups differ in spite 
of their overlapping interests. In more general terms, the profession of 
psychology is far from the point where interests can be directly translated 
into psychological processes concerned with motivation, learning. and 
personality. 

Several purely technical problems persist with interest inventories. 
Among these problems are (a) the use of unit versus multiple item weight- 
ing, (b) the optimal number of items to include in a scale, (c) the proper 
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choi i 
Choice of reference or normative groups, and (d) ways to overcome 
irrelevant response sets. 


Current Trends 


Newer Scales 


"bos omuia extension of the empirical keying method to skilled trade 
lerne was reported by Clark (1961). His Minnesota Vocational Interest 
tory (MVII) was the outcome of extensive research that began in the 


1940's with N à : 
S with Navy enlisted men. The current civilian form consists of 570 
choose the most and the 


ler Ln UO as within each triad, subjects se the me 
and oni. we qon. items are scored by a unit-weighting system, 
Clark dex el items are scored for each of 20 different occupations. 
about ite ene the research and reasoning which went into decisions 
predictiv m selection and item scoring. He also included data on the 
seque e validity of scores against choice of technical courses and sub- 
nt course grades. 

ie of the MVII for high school boys remains under study. There 

iculties with its vocabulary level and with the amount of time 


ngeded by some boys to answer the items. 
BOEN wi response set were considere 
roe Cee supplied a verification sc cete 3 
tained re y Clark's adult tradesmen. It clearly distinguis ^ scores ob. 
inia rom college males who took the M} II under a normal set an 
ben s set to randomize answers. The tendency to answer in terms ? x 
à within a triad rather than in terms of item content — to be 
it Sligible, Campbell and Sorenson (1963) had subjects retake the same 
‘tems located differently within the triad and found corresponding changes 


1n position preference. 

The relatively new, criterion-keyed Kuder Preference Record —Occu- 
Pational, Form D, also received attention. Factor analyses were aimed at 
providing a meaningful way to group the more than 40 occupational scales 
On a profile sheet and at determining whether the same profile sheet could 
Pe used for males and females. Although Schutz and Baker (1962) found 
quite similar factors for male and female freshmen at Arizona State Uni- 
versity, the occurrence of sex differences in means. standard deviations, 
and factorial complexity for given scales raised some doubts about m 
Preting scores in the same way. The factors themselves closely Wie 
factors summarized from previous research by Super and per E d 

ix of the seven factors were found in a near-replication by King an 

orrell (1964) with Michigan State University male freshmen. — i 

Hornaday and Kuder (1961) also considered the need for separate scale: 


i reri in 
or norms for the two sexes. Scores for 10 occupational scales were obtained 
ample and for females employed 


for the fathers and mothers of a student $ 
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d in two ways. Campbell and 
ale based on the infrequent 
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in similar occupations. In 9 of the 10 occupations, male keys differentiated 
women as well as men from the basic male reference group. The exception 
was the key for librarians. It might be that different job demands existed 
for male and female librarians in the sample studied. Hornaday and Kuder 
concluded that although the same profiles or norms could be used in most 
cases, research was still needed to determine possible sex differences for 
specific occupations. 


Additional Reference: Filbeck and Callis (1961). 
Revisions in the Strong Vocational Interest Blank 


Strong (1962, 1963) and Strong and others (1961) announced revisions 
in the men's form of the SVIB which had been necessitated by changes 
since 1938 in specific word meanings as well as in the nature of some 
occupations. The current form for men will contain 49 reworded items 
among the 298 which were retained from the old list of 400 items. Strong 
and others (1964) showed that as adequate a discrimination could be 
realized with fewer and also with unit-weighted items as could be obtained 
with variable weights and with many more items. 

A recently constructed inventory, which includes 101 entirely new items, 
will be used to construct additional scales. Several research-based decisions 
concerning new scales included (a) use of item-retest data to exclude un- 
stable items, (b) limitation of the number of scorable items for each scale 
to about 40, and (c) employment of unit rather than multiple-weight 
scoring. For example, when eight occupational keys were compared 
(Strong and others, 1964), no appreciable differences were found in (a) 
reliability, (b) accuracy of differentiation of an occupational group from 
a reference group, and (c) intercorrelations between scales based on unit- 
weighted and multiple-weighted methods of scoring. This move to unit 
weighting in item scoring contradicted Kuder’s (1963) argument that 
improved computer and scoring technology reduces the advantage of sim- 
plicity. Kuder proposed that item weights be the simple difference in 
percentage by which two different groups answer items in a given way- 
The possible range of weights from +100 to —100 actually reduces to a 
far smaller range in practice. Kuder's data based on this method showed a 
high degree of differentiation. Since Kuder did not compare his results 
with unit-weighted scores based on the same data, one cannot be sure that 
the gain in discrimination was worthwhile. 

The scheme proposed by Darley and Hagenah (1955) for judging 
intensity of interests in SVIB profiles was criticized by Stephenson 
(1961b) mostly for its subjectivity. The major result of the scheme has 
been the attainment of a lower degree of interjudge agreement than is 
desirable in profiles of ratings for the so-called primary, secondary, and 
reject patterns. In comparison with Darley and Hagenah’s system, Stephen- 
son proposed a somewhat more complex alternative scheme, which not only 
retained the essential features of the older system but also provided a 
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special char Cae Cy 
an 4 D 
be expected a ers proposed a simple method which should also 
Date bes Las rel € results. However, they did not directly com- 
Blanes dor Sta raro with other methods relative to the same data 
Merit hec: ord University students and for a sample of National 
Satter modd sd vee suggested that far fewer primary or reject 
Hagia qum for Korn and Parker’s method than for Darley and 
deriven. hs he x ys arbitrary agreement or objective standard can be 
| indie ings to evaluate these different methods. 

attack, "inicie sa : e d in profiles for the SVIB also came under 
within. which sco ys ie ed areas on the score sheets represented intervals 
ins anes. ‘ee res should be interpreted as not significantly different 
should epa es threw dice to define the average chance score one 
ini eme stand sanie each scale. Boundaries were set at plus and 
1962) data is ard deviation about these means. Stephenson's (1961a, 
Stottes d aie m n the traditional interpretation. So-called chance 
Vere Susie p Cie y stable on retest. Moreover, when college subjects 
Were adul to oit a p cenos scores they could not. However, when they 
earned under nion wid igher or lower scores on specific scales than they 
(1963) recom mal instructions, they could modify their scores. Campbell 
they would mended defining the shaded areas in such a manner that 
Strone’s “n represent insignificant deviations from the average scores for 
deviation Cabal iuc group. An interval of plus or minus one standard 
scores than we the vu for “men-in-general” gave a broader range of 
closer to. v] ore. This procedure surely furnishes a meaning which is 

hat counselors can use and understand than that previously 


Provided. 


Response Set 


es at face value. Among other possi- 


Fey : 
„tew psychologists take interest scor 
nse bias which may be deliberate or 


biliti 
uc b reflect a degree of respo 
scious. Kirchner (1961) compared the SVIB scores of 156 applicants 


fo p 
ot m industrial sales positions with the scores for 117 men em- 
with th n these two positions for a minimum of five years. When compared 
tigen Pie ieri the applicants scored higher on social service an 
Although etail scales and lower on sales. scientific, and ibd od 
strongly io differences between the samples were iuri pd 
safe wa pojaga a liking response set. This set couta, e the 
Bada or an applicant to respond in such a real life situation. - 
NI uc anim and Hollenbeck (1961) found that introductory psycho ogy 
s increased their scores on both the 


th Sanitary Supply Salesman and 
e Industrial Psychologist scales of the Kude 


r Preference Record—Occu- 
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pational, Form D, when these students were asked to imulate Mq eim 
for these positions or applicants for “a job in industr ; These authors 
reasoned that special predictor scales for the situation in which XA PW] 
instead of “men-in-general” are used as a reference group would be helpfu 
in industry. This conclusion was consistent with Kirchner's findings. 

The tendency to respond like (L), indifferent (1), or dislike (D) to an 
exceptional degree on the SVIB was studied by Armatas and Collister 
(1962). Their sample from the University of Kansas comprised three 
groups of males who scored one standard deviation above the mean on the 
L or I or D response count but below the mean on both of the remaining 
counts. These subgroups did not differ in academic aptitude or skills in 
English usage, reading. and mathematics. Scores on the Edwards Personal 
Preference Schedule (EPPS), the Sixteen Personality Factor Test, and 
the Inventory of Beliefs were used to test hypotheses concerning personality 
differences among these groups. Among the many /-tests and other com- 
parisons, the small number of significant differences which were in the 
predicted direction seemed sufficient to support the hypotheses. Generally. 
in comparison with individuals placing in the D and 7 categories of re- 
sponse, L respondents were more socially oriented. more flexible, and 
lower in academic achievement (grades). Relative to members falling in 
the / and L classifications, D respondents were more object oriented, more 
rigid, and lower in academic performance. In comparison with individuals 
in the L and D groups, J respondents were more intellectualistic, more 
aloof, and higher in academic achievement, especially in mathematics and 
science courses. 

In his use of different personality measures, Brown (1963) reported a 
partial replication with 220 female nursing students. He found no correla- 
tion above 0.21 between L or J or D counts on the women's form of the 
SVIB and scores on scales from the Minnesota Multiphasic Personality In- 
ventory, the Guilford-Zimmerman Temperament Schedule, and the Kuder 
Preference Record —V ocational. The SVIB profi 
be most heavily influenced by D responses, 
responses, and little affected by / responses. Brown concluded that his own 
findings as well as those of others showed that although there was a non- 
chance relationship between L or I or D modes of responding and meas- 


ures of basic personality traits, the relationships were 
practical use. 


les themselves seemed to 
somewhat modified by L 


too low for 


Construct Validation 


Determining whether inventories reflect intrinsic or extrinsic interests 
and values poses challenging problems of definition and methodology. In 
a novel design. Miller (1962) compared interest and personality traits for 
three occupational groups that differed in work setting. in job function, or 
in both characteristics. His subjects were 50 YMCA boy workers, 50 
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verit Maus wo puepe and 50 controllers. Interest and. personality 
inderdel m z from responses to selected items and to subscales of 
vite i y Significant differences were found for the sep- 
respect to ET nx o» not for age levels within these occupations. With 
Neh ni associated with job setting or function. the results 
value anientat occupations do vary in terms of intrinsic and extrinsic 
I ation. 

Inventory ds pr Age iota scores on the SVIB and Super's Work Values 
ee ed is , college students. The latter inventory was revised to 
pondent ale S, ir specific work concepts and to provide six inde- 
valie seale, roc n bjects who scored in the top and bottom quarter of each 
Strong ohio pompared for proportion of high scores on each of seven 

st groups. Relatively close agreement with expected interest 


patterns w ae th e 
was found for intrinsic values, such as heuristic-creative. In the 


mpeg €— value scales, such as work conditions and associates, 
Suziedeli: agreement was relatively low. iu 
iion: of As REIR Steimel (1963) investigated the possibility of a correla- 
golles dra cens of interests with those of basic personality needs. Male 
Pre leren = [xa and sophomores took the SVB and the Edwards Personal 
B+'s—on chedule (EPPS). The numbers of high scores—A’s and 
who dom ow of seven SI IB interest areas were compared for students 
ing high n nigh or low on each of the 15 need-scales of the EPPS. Stand- 
oni that os eunt that a student earned his highest or second highest score 
second E need-scale; standing low meant that he earned his lowest or 
owest score on that scale. The hypothesis that persons with given 


predomi s d E d f 
minant needs will show characteristic vocational interests was partly 
] common expectations, 


sup 
Ki Some of the need-interest patterns fitted ; 
«though other patterns did not. For example. students who scored high 


in s r 
the scales of achievement or endurance of the EPPS. in comparison with 
ly showed high standing in the 


t . 
eak r a not score high, more frequent ea aage aiae 
the ar biological and physical sciences. of the B än ] " = e| 
c ea of sales of the SVIB. However. students placing high on the 
i Seg scale of the EPPS, in comparison with those students not 
P acing high, more frequently scored high only in the literary or verbal 
ne area of occupational interest in the SVIB. 
wo 2 (1963) tested specific hypotheses concerning 
SV] d predominate for given vocational interest grou 
cg and a special battery of self-report measures of wor 
n males in introductory psychology- The latter set of tests wa 
“ctor analytic studies that had been aimed at providing clearly 
independent measures for material security, job freedom, personal st s 
Social service, behavior control. system. and structure. A correlation o 
sis produced both hits and misses although the over-all results cou e 


i si iance, Crites com- 
judged as satisfactory. Through use of analysis of Dua a men. 
Pared mean motivation scores des s 


at Miller pursued, Kinnane and 


the motives which 
ps. He gave the 
k motivation to 
s based on 
defined, 
atus, 


for subjects classified by 
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(primary, secondary, and reject) in each of the major Strong prp 
areas. Except for some interaction effects, the results suggested that kin 
but not degree of motivation was related to given fields of interest. R 
Crites's data concerning job freedom can be compared with the previ- 
ously cited data concerning the inferred need of autonomy in the study 
by Suziedelis and Steimel. For Crites, measures of job freedom correlated 
significantly (low positive) with average interest score in the biological 
and physical sciences and about zero with scores in the verbal expressive 
area. That different measures of the same motive construct failed to 
correlate with the same measures of interests brings to light the need for 
more adequate construct definitions and measures than are now available. 
Additional References: Springob (1963) ; Super (1962). 


Conclusion 


Perhaps the most important developments of this review period were 
(a) the culmination of Clark's (1961) work with skilled-trade interests 
and (b) revisions in the SVIB. Purely psychometric matters such as item 
selection and weighting were handled most expertly, and the problem of 
coping with irrelevant response was given careful consideration. 

Much remains to be done if one is to link interest scores to motivation, 
learning, or personality variables. Simply correlating current measures of 
interest with measures of what is inferred to be motivation or with other 
variables is an easy but often fruitless gesture. One can hardly expect to 
clarify the meaning of one construct by relating it to a second construct 
which is not adequately understood. More effort should be put into experi- 
ments in which learning or motivation processes are better controlled. 
Conditions to date still indicate that psychologists know much better how 


to apply the mechanics of psychometrics to the construction of interest tests 
than how to explain what they measure. 
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CHAPTER VI 


Trends in the Measurement of Personality 


EUGENE L. GAIER and WILLIAM F. WHITE* 


If " 
Sade ari > pate of personality was once largely confined to 
Bs irom tenner ires measurements from a limited number of tests 
fins Ree a oyed with little apparent regard for theoretical implica- 
for the ad D college sapligrmores, then the research to be reviewed 
probably rival. ger 1961 to July. 1964 revealed some interesting, 
worthy iem es albeit somewhat elusive changes. The following praise- 
aS bes E were noted: (a) appraisal of other investigators’ as well 
s own data when the latter were being interpreted, (b) a trend 
about the results of a given investigation 


tow : 
ard framing new hypotheses 
isting theories, and 


eens served to test or to elaborate upon ex 
statistically ics of new theoretical formulations. However, from the 
IPAE. n lec i duaiaite studies concerned with personality to the more 
the last a ic surveys encountered in symposia papers. the studies during 

ee years in toto reflected a picture of chaos, from which, how- 
ise amid an increasing sophistication 
generally acceptable, theories. 
the literature in terms of an 
able even to investiga- 
and stated 


of ee seems to be more prom 
kind ep nd tested, perhaps even more 
apparent degit has appeared to plague 
lors win ac ; of criteria that were generally acceptable 
tomite seemingly comparable training. experience, 

x ents. 
omen pesene, Hang 
s that were judged pertinent to pe 


Period cover : $ 
d covered. These groupings include method 
] desirability. 


five major groupings of 
asurement for the 
rsonality 


was organized around 
rsonality me 
and creativi and design. pe 

creativity, response set and socia self-concept and body 


ir 
nage, and anxiety. 


Method and Design 

rsonality have provided no 
development and refinement 
been the extended interest in 
and stimulated by the 
Another trend 


A T 
though empirical attempts to assess pe 


heh din, they have led to a continuing 
D. migues and instruments. One trend has 
ae and R-techniques that were facilitated 

spread use of computers and factor analytic methods. 


in he writers wish to express their appreciation to jip Rind Guenther soe Merete 
ing the bibliographic entries. 
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has been in the direction of the validation of long-established beet 
with respect to external criteria. Noteworthy attempts were made to rela 
these tests and instruments to constructs, models. and theories. — 

Hypothesis generation, based on factor analytic approaches as iig 
from hypothesis testing, was evident in the field of personality gne s tn 
This trend coincided with the expanding market of computers and wi 
the extensive listings of available statistical programs. Criticizing m 
research in personality involving factor analytic approaches. Smedslund 
(1964) argued that too little concern for the psychological processes 
relating to item response culminated in only a resultant intuitive "em = 
the interpretation of scores. A substantial lack of objective and. — 
operational connections also resulted from the questionable labeling o 
factors. .- i 

Earlier, Cattell (1962b) asserted that any proper conceptualization 8 
the relation of personality and role would be unattainable without one's 
grounding the discussion in general group theory. Accordingly, in a single 
theoretical model he posited 28 propositions defining a series of person- 
ality, role, and group concepts in group behavior which he believed to be 
mutually consistent. For Cattell, the concept of dimensions defining the 
parameters he envisioned were based on a factor analytic approach. al- 
though he was quick to add the qualification that this orientation did not 
constitute an indispensable restriction. Nevertheless, Cattell (1962a. b: 
1963a, b; 1964) continued both to extend and to amplify his approach— 
an approach that has been dominated by his singlemindedness in demon- 
strating meaningful personality parameters via factor analytic techniques 
(Cattell, Knapp, and Scheier, 1961; Cattell and Tsujioka, 1964). In spite 
of the confusion caused by his esoteric vocabulary and in spite of a 
tendency to overinterpret factors based on unreliable measures, Cattell 
initiated lively controversy in a field too often preoccupied with the 
obvious. 

On the other hand, Overall (1964) noted the lack of scientific status 
of factors and seriously questioned the relationship between factor analysis 
and primary dimensions of objects. In his illustrative example, neither 
the number nor the nature of primary factors had an inherent relationship 
to the true conceptual dimensions of the objects measured. From considera- 
tion of five analyses of objective personality data, Cattell, Knapp. and 
Scheier (1961) described second-order factors as contributing to the 
understanding of personality, although their predictive power was judged 
to be less than that of first-order factors. Cattell (1962a) not only defende 
the interpretation of rotated factors in oblique rather than in orthogonal 
form but also stressed the importance of the interpretation of factors a5 
influencing agents upon which the variables in the study are dependent. 
Thus, during the period from 1961 to 1961, increased attention was 
directed toward the purposes of factor analysis, the explorations of prob- 
lem areas, and the ways and means of interpreting factors. 
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Simulation of Personality 

The. advancement of computer technology has brought ever closer the 
potential that computer simulation may hold for personality theory. Per- 
haps there was no greater contribution of computer simulation to per- 
sonality research than that of the simultaneous handling of vast amounts 
i data ina multivariate manner. Although fear had been expressed that 
oe of machines might bind and shape the limitless capacities of 
; the optimism and heuristic value of the articles edited by Tomkins 
and Messick (1963) pointed out unique psychological contributions of 
PCR el tion studies on cognitive behavior by 
nad 33) and on models of perception and language by Uhr (1963) 
»ecome creditable vehicles for expressing personality theories. If 
e in tune with psychological theory, 
ant of the details of computer 

ystems. 


co sca e ; a A 
mputer simulation. The simula 


educators : 
Stilton and psychologists are to b 
it w > i 
$ vill be necessary for them to be cogniz 
rogr. 4 

Programming and to be prepared to program new $ 


Subjee 
Subjects and Experimenter 


: Perhaps because most of the researchers who publish in personality 
assessment and dynamics were academicians. freshmen and sophomores 
continued to be captive lots drawn for experimental study. Orne (1962) 
Suggested that subjects formed their own hypotheses regarding the purpose 
or experiments and then responded to variables in a manner concomitant 
with their private hypotheses. The biased research resulting from per- 
sonality characteristics of volunteers has usually limited the formation of 
valid generalizations to larger populations. [n 
. Since students in psychology classes are most often obliged to participate 
" varied research in order to gain course cred 
"gd of those who keep their appointed times 
nt. Levitt, Lubin, and Brady (1962) demonstrated. tha 
volunteers were contaminated by those who promised to 
failed to appear for the experiment. Male “no-shows” were reported by 
Leipold and James (1962) to be more and less in need of social 
approval than those who “showed”: female lower grade 
m eR. than female "shows 
. the hierarchy of birth order became 
lion of the validity of sample selection. Glass and others (1963) concluded 
that first-borns did not react so strongly to frustration as did later-borns. 
They suggested that birth order be considered an ecological as n as a 
Psychological variable in the interpretation of empirical findings. n con- 
trasting late-borns to first-borns. Stotland and Cottrell (1962) concluded 
that late-born youth react more con ly to experimental manipulation. 


Although the topic of experimenter bias and of the effect of the experi- 
tudes on the performance of subjects in 


menter's personality and atti 
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lit. the problem of bias in 


and tasks becomes impor- 
t studies using 
participate but 


anxious 
“no-shows” had a 


a debatable variable in the evalua- 


sistent 
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personality ratings is somewhat taboo, two reports highlighted the problem 
of experimenter determinants. Among the many conclusions draw n from 
his studies, Rosenthal (1963) reported a non-chance relationship between 
the extent and tenacity with which the experimenter viewed his hy parser 
and the response behavior produced by subjects in those tasks associated 
with that hypothesis and that specific experimenter. Early data Tétukns 
were also found to influence the experimenter and to produce experimental 
bias. Rosenthal suggested that the chief source of the bias-producing 
agents was pre-experimental instruction and behavior which invoked some 
form of response set. In their study of the conditioning of hostile verbs. 
Sarason and Campbell (1962) reported a significant positive interaction 
effect between the general hostility test scores of both experimenter and 
subject. 


Personality and Creativity 


A major by-product of the findings reported on differences between 
intelligence per se and creativity has been an intensified effort to isolate 
and to identify personality facets identified with creative behavior. 

Despite reservations expressed by de Mille and Merrifield (1962) re- 
garding the adequacy of its methodology, the work of Getzels and Jackson 
(1962) may well have been a major impetus for the marked upsurge in 
research concerning creativity. The literature abounded with studies in 
which attempts were made to pinpoint the creative function in terms of 
four orientations: (a) the nature and quality of the product created. 
(b) the actual expression of creative acts and the continuing intellectual 
process during the creation, (c) the nature of the creator, and (d) en- 
vironmental factors that fostered creativity. Although results of varying 


degrees of generalizability and validity were reported —and the debate on 
whether the creative process was really 


different from other problem- 
solving 


processes was extended and intensified—a singular lack of empiri- 
cal research grounded in personality theory, as such, was observed. 

In his study of National Merit Scholarship recipients, Holland (1961)- 
who used a measure of risk-taking tendencies, reported that creative per- 
formance among high school students occurred more frequently among 
those who were independent, expressive, 
inclined to have high aspirations for future achievement than among those 
who exhibited qualities essentially opposite to those enumerated. 

The work of Barron (1962, 1963) provided data on measures of per 
sonality and creativity. From a group of professional and student writers- 
Barron (1962) found distinguished writers to be particularly high on the 
scales of the Minnesota Multiphasic Personality Inventory (MMPI) 
associated with schizoid, depressive. hysterical, psychopathic, and femi- 
ninity predispositions. On the scales of a test developed around Jungian 
typology. the distinguished writers were distinctly more introverted than eX 
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troverted, more dependent on feeling than on thinking, and more intuitive 
than sense-experience oriented. As a group. these writers were unusually 
high in conceptual and verbal intelligence. What may be most significant for 
personality theory is that having an IQ above 120 appears to be but a 
From motivational and stylistic variables 
ativity—results in accord not 
arlier studies 


negligible factor in creativity. 

emerged the primary determinants of cre 
^ only with Torrance's (1961) latest rapprochement of his e 
but also with Burkhart’s (1962) findings that creative students in art 
scored high on measures of four dimensions of personality creativity: 
(a) spontaneous abstract orientation, (b) divergent power. (c) ideational 
and perceptual openness, and (d) social self-determination. 

Also relevant were the observations of Selye (1964), who delimited 
16 personality types among creative scientists, such as fact collector, 
gadgeteer, bookworm, and classifier. Although the reviewer does not wish 
to propose that making lists will provide all the answers, this array might 
provide a springboard which could even catapult creativity theory into 
not only a less dehumanized but also an appropriately more complex set 
of answers to many current questions. Tem 

Investigations of scales designed to assess personality and originality 
continued to point up that the validity of the scales of personality inven- 
tories such as the California Personality Inventory (CPI) and the Sixteen 
Personality Factor Questionnaire is generally low, even in the face of 
considerable variation among studies by Holland (1961) and Smith and 
others (1961). Validity caellicients of nonintellective scales intended to 
assess originality were also low, although a wide range of correlational 
values was reported. r 

In general, biographical items and measures of past achievement ap- 
peared to be the most valid and efficient predictors of creativity, followed 
by self-ratings and direct expressions of goals and aspirations. Originality 
and personality inventories ran what Taylor (1964) called “a poor third. 
Traditional aptitude and general intelligence measures ranked even below 
Personality inventories. 

Some of the most provocative data 
ures and approaches continued to emerge from repor 
F how j of Utah conferences on the identification © 

alent (Taylor and Barron. 1963). : 

, Perhaps. creativity is too complex and elusive to be probed — 

in laboratory isolation. What the current data from many studies 

Sugge: , tho i tive is simultaneously tough- 
ggest was that the person who is cred A imuli in a novel 

minded and flexible enough to cope with environmental stimu' 

Yet acceptable fashion. 


associated with new types of meas- 
ts made at the Uni- 


f creative scientific 


] Desirability 


ept of response set dur- 
a. 1962; McGee. 1962a, 
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b; Marlowe, 1962) continued to leave the major question in this area still 
unresolved: When is response set reflecting responses to the individual 
items rather than reactions to content? In fact. when the initial studies of 
response set appeared, investigators were quick to sound the tocsin herald- 
ing a revolution in personality measurement. Instead, what seems to have 
emerged is the sound of civil war among investigators. 


Social Desirability 


Interpreting high scores on a scale of social desirability to be indicative 
of the need for social approval, Marlowe and Crowne (1961) found that 
subjects who scored above the median compared to those below the median 
expressed significantly more favorable attitudes in response to questions 
about a tedious spool-packing task. Jackson and Pacine (1961) reported 
consistently low and generally nonsignificant coefficients. of correlation 
between sex membership and the rated desirability of Gough's CPI items. 
In comparison with males, females choose relatively more of the desirable 
items. 

In the factor analysis of a matrix composed of a particularly broad 
sampling of 61 personality scales—MMPI scales and three other per- 
sonality scales—Edwards, Diers, and Walker (1962) interpreted the first 
rotated factor as the tendency for one to give socially desirable responses 
and the second factor as the tendency for one to provide acquiescent 
responses. À third factor was tentatively identified as the inclination to 
falsify answers to personality items. A later factor study by Edwards and 
Walsh (1961) yielded six factors. with social desirability as the first factor. 

Utilizing 10 aggressive statements written in seven different styles. Gold- 
fried and McKenzie (1962) investigated the differential effect of item 
style on social desirability and on frequency of endorsement for males and 
females. Although both social desirability and frequency of endorsement 
were found to be influenced by item style, the influence differed for the 
two sexes in the case of frequency of endorsement but not in the case 9 
social desirability of ratings. 

Barthel and Crowne (1962) confirmed the hypothesis that when ap- 
proval-motivated subjects employed the social desirability categorization 
displaying the greatest taboo-neutral word discrepancy scores and when 
approval-oriented subjects adopted a perceptual speed set toward the 
experiment, minimal discrepancies were obtained. The investigators intel 
preted their data to imply that a major component of variance in pet 
sonality defense studies was contributed by the conscious inhibition 9 
responses. In a related study. Dicken (1963) employed measures of £004 
impression, social desirability, and acquiescence as suppressor variables 
with nine personality scales of the CPI. Nine corresponding behavior 
ratings were used as criteria in one or more of six independent samples- 
When variance in the measures of good impression and desirability was 
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accounted for, significant gains in validity were rare, and no gain in 
validity resulted from suppression of acquiescence. Existing methods of 
correcting for response set variance in personality scales did not appear 
to be practical. In Dicken’s estimation. the importance of social desirability 
and acquiescence in questionnaire personality assessment may have been 
vastly overemphasized. i 

. When an instrument titled Survey of Interpersonal Values was admin- 
| mie under normal instructions and later under faking instructions 
gal 1963), scores increased on measures of conformity and leader- 
Ship, but decreased on measures of support and independence. Employing 
a sample of 65 college students and 105 male psychiatric patients, Boe and 
Kogan (1961) utilized a revised check list of items concerned with inter- 
nd values and proceeded to relate scores on seven measures of social 
desirability obtained from the check list to several scales of the MMPI. 
Although scores on scales of social desirability that were derived from the 
check list served to point out fakers of social desirability among both 
college students and psychiatric patients. social desirability scores based 
only on the tendency to endorse socially approved items exhibited a low 
correlation with measures of social desirability on the MMPI. However. 
Various scores in social desirability based upon all the items in the check 


list were highly correlated. 

In addition to the studies that were 
theoretical base of personality assessment, 
specifically concerned with the search for mean 
social desirability and varied facets of personality (J 
1962: Mahler, 1962). 


aimed toward broadening the 
many investigations were 
ingful relationships among 
ackson and Messick. 


Acquiescence 

Langer (1962) addressed himself to the question of determining whether 
à meaningful relationship existed between response sets associated with 
acquiescence and social desirability dimensions as manifested both in 
Gough's Adjective Checklist and in the W and Dd responses on the 
Structured-Objective Rorschach Test (SORT). On the basis of responses 
to Gough's instrument, Langer divided his subjects into agree and disagree 
Stoups, which were further subdivided on the basis of sex. The SORT was 
Subsequently administered. A relationship between acquiescence and social 
desirability response sets was partially confirmed. in view of the fact that 
the male disagree group exhibited a significant rank order comi aum 
between the disagree dimension and Dd responses, whereas the Er 
disagree group yielded a significant rho value between disagree set an 
responses. The commonelity underlying the set phenomena was s aee 
Sized in part to be attributable to the interaction of cognitive style with 


lest variables. 
Of particular relevance to the matte 
bility were Wiggins’ (1962. 1963) attem 


r of acquiescence and ‘or social desira- 


pts to differentiate among strategic. 
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method, and stylistic sources of variance in the MMPI. For Wiggins. 
strategic variance occurred when scales were used to discriminate between 
criterion groups and a normative population. Method variance. which was 
obtained through use of keying categories such as true and false, was also 
affected by the relative proportion of items in each category as well as 
by the relative item popularity. Stylistic variance referred to subjects 
characteristic response styles, such as acquiescence-defensiveness—styles 
which are relatively independent of the intended measurable function 
underlying the test itself. In a factor analysis of seven social desirability 
scales and six acquiescence scales, the social desirability scales clustered 
in one quadrant, whereas the six measures of acquiescence were found 
in another. 


Attitude Scales 


In his attempt to answer the question concerning what attitude scales 
measure, Taylor (1961b) reported average interquartile ranges of item 
ratings for social desirability to be about three scale positions out of nine. 
Data were presented to show that correlations between scores on attitude 
scales and responses to items of a personality scale which were used as a 
basis for reporting a general response tendency were not explainable in 
terms of reference to a general acquiescence set. His findings on “yea- 
sayers" were also relevant (Taylor, 1961a). Similarly, to identify the 
concept of social desirability, Bell (1962) suggested that the elevation 
component of an individual's personality inventory profile might repre- 


sent acquiescence or social desirability, whereas scatter might simply 
reflect deviant response set. 


Personal Needs 


Garrison and Scott (1962) noted th 
tive women teachers varied signific 
characteristics. They re 
needs from the Edwar 


at specific personal needs of prospec- 
antly according to selected persona 
ported that 14 of 15 measurable psychological 
ds Personal Preference Schedule (EPPS) varied 
according to one or more personal characteristics of the subject, including 


marital status, age, college class standing. number of siblings, order of 


birth in family, educational level of mother and father, and ceoupation of 
father. 


Methodology 


In his investigation with subjects in Western Australia who were in- 
structed to answer questions on the MMPI as though the responses reflected 
their “ideal selves” as significantly more extroverted, Star (1962) found 
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bee = subjects had scores which suggested that they were less neurotic 
eris e te than subjects who took the instruments under regular 
NV. : S a concluded not only that the MMPI scales were highly 
Eine: i ciu ee set but also that the direction and extent of dis- 
ceed es = rom one population to another. Orne (1962) firmly 
er amet in er that the subject must be recognized as an active 
ea a H an eapeniment and that it may be quite fruitful to view the 
em — y emn as a very special form of social interaction. He 
fotos on the subject's behavior in the experimental setting is a 
L ib he total situation encompassing experimental variables. 
a peii o six kinds of deviant response tendencies, Sechrest 
sen ag ( 963) took the position that deviant responses cannot be 
one specific way: the investigator must take into account the 


variety YAN s s . " 
ety of ways people respond in a digressing fashion. 


Self-Concept and Body Image 


antify the individual's concept of the 


In the attempt to isolate and to qu 
d to augment the number of 


c he M of investigators appeare i f 
E ogies employed. Wylie (1961) presented a comprehensive review 
dide uments used prior to 1961 to assess the self. In this critical survey 
tion i research, Wylie concluded that four key flaws had plagued explora- 
lacks this field: (a) the existing theories were vague. incomplete. and 
Shanti in extensive empirical support; (b) the total accumulation of sub- 
antive studies was inadequate, inconclusive; and singularly disappointing, 
especially in view of the amount of effort expended in investigation of the 
self; (c) the existing theories—such as they were—were unable to account 
for behaviors observed, particularly in the context of the school of the 
everyday world; and (d) a most immediate need existed for the intro- 
duction of multivariate studies that would allow investigators to resolve 
Systematically instead of haphazardly many contradictions associated with 
the possible interaction of variables. Three years later, these same criti- 
cisms remain valid and unanswered. One can only hope that the serious 
investigator in this area will review Wylie's work and wil 


] undertake 
appropriate studies. 


Self-Concept 
f continued to suffer because of their lack of 


was raised concerning the nature of the 
and Courson (1963) boldly asserted that 


confusion exists in the research literature on the self-concept because 


researchers accept the self-report. The typical studies were based on various 
forms of self-report inventories. Moreover. with semisophisticated subjects. 


considerable room existed for malingering in the instruments used. These 
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While studies on the sel 
generalizability, a basic question 
study of the self. Combs, Soper. 
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authors reported low correlations between scores on self-reports and MEE 
of sixth grade children by teachers who had observed the children in Li 
situations. Perhaps. as an anticlimax. these investigators were cautious a 
conclude that although the self-concept and the self-report are related. the 
interchangeable use of the two terms is unjustified. M ] à; 

Relevant to these findings is the investigation of Field. Kehas. anc 
Tiedeman (1963) of the self-concept in career development. They theorized 
that translating the self-concept into vocational terms is something that 
occurs neither on one occasion nor with any degree of finality. Rather. it 
is a developmental process with respect to which the individual may fluc- 
tuate in his choices because of maturational or situational changes. In fact. 
they were severely critical of the Q-sort techniques’ (Block. 1961) failure 
to take into account the need for the person to see himself in a given step 
of a continuing process: in this method of self-rating. the person sees 
himself merely as a product. 

Mueller (1963) reported that when he acted as judge in rating others. 
the insight of the judges into himself was related to success in establishing 
insight into other subjects. The ability to predict and the ability to describe 
accurately were related, although it was less difficult for his subjects to 
predict than to describe. This finding might suggest that students who can 
set and accept their own self-concepts in contrast to those who cannot may 


be more successful in seeing how others react and how others view 
themselves. 


Body Image 


The concept of the body as a protective enclosure or container for the 
ego has stimulated much empirical investigation sinc 
formulation introduced by Fisher and Clevel 
that since hody references were especi 
subjects and since the obtained 
unique responses on the Rorsch 
related to the m 


e the initial theoretical 
and (1958). They speculated 
ally noted in the fantasies of their 
protocols contained unusual numbers of 
ach test. these responses were in some Way 
anner in which the subjects perceived their bodies. 
In the face of numerous contradictory studies questioning the Fisher- 
Cleveland scoring scheme, Daston and McConnell (1962) concluded that 
the system was sufficiently objective and stable under the conditions 
employed by them to be of value in assessing. effects of various treatment 
conditions of body image. 
Since 1958 a number of measures have been em 
Completion Test, the Rorschach barrier score. and the Thematic Apper- 
ception Test (TAT). The Rorschach penetration scores continued to be the 
most popularly employed measures of the body 
During the past three years. the role of body 
of personality has received creditable attention. 


ployed: The Drawing- 


image indexes. 
image in the organization 
In a comparison of somatic 
and psychological assessments among hospitalized. and nonhospitalized 
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subjects, Jaska Z 

i arr aks Reed (1963) did not find a significant correlation 

im Dis oe projective measures of body image. Moreover, it was 

oa "os "a dba nonhospitalized subjects did express greater 

ergy cae bia y parts and body functions on a verbal self- 

osos anil Cn an ba hospitalized subjects. In a general conclusion, 

n bodies ee he a T" (1963) noted that amputee children represented their 
redeem ed o le realistically and nondefensively in a Draw-a- 

cluded a anlar a ren either omitted amputated bodily parts or in- 

Pub m i E device in the projection of their own images. 

^ Elena: $ ationship between body image and small-group behavior, 
and Morton (1962) administered ink blot tests and sociometric 

eparate human relations laboratories. 
those considered to be well integrated 
more votes for being influential, 


ae to subjects at three s 
i ie pied score participants (ie, 
cic d en J did receive significantly ; 
jur BE dig m e goal striving than did those subjects with low 
The pars Mar. nie iduals judged to have a diffuse body image). 
tive of pate self me scores of the latter group were viewed as indica- 
Coie is e; selfish interests. 
entrating on children, Rowe 


that 7 à 
E a percent of a sample of Negro 

e! o £ " : S H 
ihat fh 5.6 years underestimated their body size. The authors suggested 
ere may be a “time lag” between the body image and phases of 


S : 

Fermi ie Roi In an investigation concerned with white male chil- 

60 DEAE in age from 7 to 14. Adams and Caldwell (1963) found that 

rbi ge ^s their normal sample and 53 percent of their disturbed group 
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varied anxiety studies through his categorizing past experimental trials of 
anxious subjects under headings of related experimental variables. 


Situational Variables 


Alexander and Husek (1962) reported research efforts that were focused 
on the development and cross validation of a measuring instrument for 
situational anxiety. Comparing anxious and nonanxious subjects on SIX 
exploratory anxiety scales, they presented partial evidence for the validity 
of the scales. The reliability of the scales was indicated from the results 
of cross validating and refining the scales of the Anxiety Differential test. 
Husek and Alexander (1963) also concluded that the Anxiety Differential 
test was reliable and sensitive. Data from experiments by Harleston | 1962) 
revealed that anxiety level depends for its effect on task and situational 
factors as well as on the sex of the subject. 

Two empirical studies concerned with anxiety in the school setting were 
noted. Scarborough, Hindsman, and Hanna (1961) concluded that anxiety 
made a difference in children's reading and language achievement when 
both intelligence and sex were controlled. Pringle and Cox (1963) admin- 
istered Sarason's Test Anxiety Scale and General Anxiety Scale and 
reported, contrary to traditional thinking, that children in ungraded, child- 
centered, progressive schools received higher mean scores on both anxiety 
scales than did children in graded, adult-directed, traditional schools. 

Winkel and Sarason (1961) attempted to assess the extent to which the 
experimenter variable actually became a confounding experimental factor- 
In a verbal learning situation with a strong experimenter-subject inter- 
personal component, three organismic variables—test anxiety of subjects: 
experimenters, and sex of subject—were manipulated. High test-anxious 


subjects did perform more adequately at lower levels of interpersonal 
involvement than did low test-anxious subjects; 


. females performed bes! 
when run by low test-anxious experimenters. 


Need Achievement 


Following the initial work of McClelland (1953) and Atkinson (1958) 
on need achievement, several investigators dealt with need achievement in 
relationship to a host of other behaviors, such as conformity, risk takings 
problem solving, vocal sounds, and scholastic performance. Because of the 
many variables believed to be related to the achievement motive, Moss 
and Kagan (1961) emphatically stated that since the term need achiev 
ment was too broad, greater usefulness might be obtained by replacing 
this concept with variables related to specific behaviors. 

Although systematic investigation on achievement motivation has been 
limited, disjointed, and scattered, the studies reviewed by Crandall (1963) 
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contained moderate but statistically significant consistencies across situa- 
tions as well as over the maturing years of children. 

Questioning whether the achievement motive exerts any influence on 
response learning under conditions of variable reinforcement, Burdick 
(1964) concluded that the schedule of reinforced responses and the need 
for achievement as measured did affect scores in a simple discrimination 

learning task. The arousal had the greatest impact on subjects with a 
low level of achievement. 

Applying a survey method in the analysis of the high need for achieve- 
ment with a northern Negro urban sample, Nuttall (1964) obtained posi- 
tive correlations between a measure of need achievement and authoritarian 
child rearing attitudes. Northern Negro men, high on a measure of need 
achievement, appeared to reject religious feelings, to feel victimized by 
racial discrimination, and to be racially more aggressive than Negroes low 
on the same measure. Negro women reared in the North and standing 
high on a measure of need achievement tended to join organizations such 
as the National Association for the Advancement of Colored People 
(NAACP) and the Congress of Racial Equality (CORE) more frequently 
than did women placing low on the same measure. 

Studies attempting to relate need achievement to academic performance 
have shown contradictory results, as exemplified by the investigations of 
Cole and others (1962), Rosen (1956). and Veroff and others (1960). 
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lack of ego strength. Rather, he interpreted it as an expression of psy- 
chological health. 

Four other studies concerned with the relationship of anxiety measures 
to personality variables were noteworthy. Meyers and Hohle (1962) failed 
to find significance in the relation of anxiety indexes to measures of con- 
formity and confidence among unselected college students. From a sample 
of mildly retarded male youth, Matthews and Levy (1961) concluded that 
the scores of retardates on the Children's Form of the Manifest Anxiety 
Scale (CMAS) were strongly influenced by response sets as well as by 
situational and capacity variables. The CMAS was considered inappro- 

priate for a retarded, institutionalized population. Mogar (1962) failed to 
find significance between 12 presumed indicators of anxiety on the Draw- 
a-Person test and an external criterion represented by scores on a scale of 
manifest anxiety for a sample of psychiatric patients. Replication of a 
study in which both size of figure and its placement were correlated with 
scores on a scale of manifest anxiety among females also failed to yield 
significance. Having administered a forced choice scale of anxiety and 

social desirability items to undergraduates, Bergs and Martin (1961) 

found nonsignificant but positive correlations ranging from 0.10 to 0.12 
between low scores on social desirability and an anxiety index. 


Outlook 


In many of the investigations of personality measurement which have 
been reviewed, recommendations were made for completion of additional 
research. The reviewers suggest that it is not so much additional research 
as integrative research that is now needed. Indeed, it seems time to talk 
less of statistical significance per se and more of the relationships of 
statistically significant findings to (a) assumptions underlying the research. 
(b) theoretical implications of the results, and (c) their consistency with 
those of other investigators. Too often. in the materials that have been 
reviewed, logical significance was lost in statistical significance. An almost 
frantic quality pervaded sometimes prematurely reported or not quite 
concluded studies. 

The widespread use of correlational designs continued. Considerable 
emphasis was placed on establishing relationships between two response 
variables. Such variables only approximated measurements of personality 
parameters which conceivably were outside a research setting. 

Computers have become a mixed blessing. Although their use in appro- 
priately designed research has obvious advantages. their disadvantages. 
albeit not intrinsic. emerge in the hands of certain investigators. For some 
workers, the availability of a computer seems to heighten to almost 
orgiastic proportions the impulse to design research that is oriented toward 
the minute statistical analysis of data at the expense of substantive and 
theoretical considerations. Perhaps this sort of reaction is to be expected, 
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to some extent, as statisticians and computer specialists continue to develo 
^ and to refine a relatively new technology. i 
eee methods in education, two questions may be 
abiak «x a simp mines id | of statistics or the cooperation with 
Ri een » € not ed ucational researchers orient themselves to an 
iber iie : gon ed containing meaningful, preliminary questions that 
y to answer after data have been collected and statistically analyzed? 
Although room must always be made for the devising and checking of new 
Personality tests. should so many investigators be giving such a hbieb 
Proportion of their effort to this objective? Perhaps, in spirit, psychology 
. too should be more imaginative. Otherwise, like pop art, it might well 
deserve the comment which occurs to one in that context: How many 
cans of Campbell's soup should one paint? 

The singular lack of attention to personality dynamics was a substantial 
deficit in the research reviewed. Perhaps psychological investigators still 
operate within a professional lag which makes them fear that, unless they 
restrict themselves to quite superficial matters or otherwise useful be- 
havioral terminology, they will be accused of the black magic once asso- 
ciated with the early “dynamic” psychologists. And yet, psychologists 
surely have the obligation, far short of witchcraft, to carry scientific 
explanations, or at least descriptions and speculations, into levels deeper 
than those found in behavioral terminology. . 2. 

With respect to personality research in relation to education, it is 
disappointing that so much information that has been enthusiastically and 
expensively gathered has resulted in relatively little that is interpreted, or 
interpretable, in a practical manner. To some extent, substantial gains 
must await progress in personality theory. Nevertheless, during y pur 
ning stages in research some cognizance of people as people woule SUREN 
bring forth products which more often have a perceptible connection with 
the classroom or even with the extra-school world than those which most 


current research furnishes. 
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CHAPTER VII 


Statistical Trends Relevant to Measurement 


SAMUEL T. MAYO 


The methodological pattern which Michael set in the Review ws 
of February 1956 and February 1959 and which Schrader —À : 
the February 1962 issue has been followed in this chapter. No at vua 
was made to review factor analysis, computer applications. or Emu 
physics, since these topics were covered, respectively, in i spun : Schiff 
of the December 1963 Review by Merrifield and Cliff. Baker, and 5c al 
man and Messick. In addition, Tucker’s (1963) chapter in the = 
Review of Psychology was concerned in part with these same three her 
Relevant to the content of this chapter was the publication of Ghise x 
(1961) concise textbook, Theory of Psychological Measurement, whic 
was devoted to a few selected topics in advanced measurement theory. 
Since the volume appeared too recently for reviews to be available, one 
cannot judge its possible impact on the profession. It would seem, p 
theless, to be an up-to-date supplement to Gulliksen's (1050) Theory 9, 
Mental Tests, although it appears to be somewhat less technical cenis 
Gulliksen's contribution. There have been altogether too few new books pub- 
lished in advanced measurement which have elucidated the highly suli 
and sophisticated statistical developments of recent years in psychologica 
measurement, : 
Significant references found during the period reviewed were primarily 
concerned with the following emphases: (a) critiques of previous tech- 
nology, (b) refinements in existing technology, (c) expositions of newer 
technology, or (d) exhortations toward needed research. Previous work 
in configural Scoring, linkage analysis, scalogram analysis, item discrim- 
ination indexes, moderator variables, and equating of test scores, to men- 


tion a few topies, was continued on both theoretical and empirical bases. 


Item Analysis 


During the past three years there has been not only the development " 
some new indexes of item characteristics but also the preparation © 
critical and expository papers on existing techniques. 


Psychological Considerations 


Recent criticisms have been made regarding the overemphasis upon -— 
development relative to the use of empirical criteria at the exclusion 0 
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consideration of psychological theory as a basis for the explanation of 
the empirical relationships. One recent example of such criticism was that 
by Smedslund (1964, p. 252). Campbell (1961) criticized the psycho- 
logically unsatisfactory nature of the percentage of individuals passing 
an item as an index of its difficulty. Campbell felt that it was necessary 
to know not only how many subjects pass an item but also why they pass. 
There was the rather pronounced implication that the aspects of complex- 
ity, abstractness, and novelty of item content are able to markedly affect 
item difficulty. This type of difficulty is called intrinsic. Although test con- 
structors should have reacted to the challenge afforded by Campbell's 
argument, they have neither devoted systematic attention to it nor given 
a precise statement of the manner in which the three aspects might be 


manipulated in order to change item difficulty. 
In his empirical study, Campbell used for his test content geometrical 


figures, which were to be classified. Perhaps the main result of the study 
was the finding that the crucial determinant of the intrinsic difficulty was 
the qualitative nature of the classifying property. The author felt that al- 
though some light has been shed upon the nature of the psychological 
determinants of item difficulty, more questions had been raised than had 
been answered. Thus, the field remains open for extensive future 


investigation. 

Myers (1962) explored the relationship between item difficulty and 
both the validity and reliability of a test. By way of introduction, Myers 
pointed out what appears to have been a cultural lag in the practice of 
test construction. In spite of general agreement among test theorists that, 
for maximum reliability, tests should be homogeneous in item difficulty. 
those who produce standardized tests continue to believe that items 
selected for a test should represent a wide range in difficulty. There seemed 
to be several rationalizations for this contradiction between theory and 
practice. Whereas the theorists are concerned with difficulty primarily in a 
statistical sense—that is, with the percentage of individuals in a sample 
who answer an item correctly—the test constructors are concerned with 
difficulty in a psychological sense. as it affects the morale or behavior of 


the test taker. Although it seems reasonable to assume that these two 
elated, it is not easy to determine 


concepts of difficulty are imperfectly corr 
what makes an item difficult. One can understand the reluctance to change 
a practice that has been considered adeq 
Myers’ study was addressed to the pro 
situation a selected set of items of average difficulty w à 
and more valid than would be a selected set of items half of which were 
considerably more difficult and half of which were considerably easier 
than the average level of difficulty. He failed to find any difference in 
validity between tests with peaked and U-shaped distributions of item 
difficulty, although the tests with peaked distributions tended to be more 
reliable. Myers further concluded that more needs to be known about what 


makes an item difficult. 
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Hawes and Helmstadter (1963) set forth another kind of criticism of 
the traditional measurement of item difficulty, in terms of the percentage 
of examinees in a group answering correctly. These authors felt that the 
major limitation to the use of such a percentage is that it does not reveal 
psychological differences among individuals which exist relative to the 
determination of the degree of difficulty for each single item. It is possible 
to have items with identical indexes of difficulty which psychologically 
are not of the same difficulty. In spite of this limitation, no one seems to 
have suggested any alternative ways of determining item difficulty. Hawes 
and Helmstadter attempted to devise a new measure of item difficulty in 
which the response of each individual to each item was considered. Results 
indicated that the method of counting the number of responses that were 
required to obtain the correct answer did provide a useful index for 
obtaining the difficulty for an individual. However, for practical purposes 
in test development, it would make little difference whether the new or 
traditional index were used. 


Item Selection 


Bridgman (1961) investigated the relation. between bivariate normal 
correlation and the upper-lower discrimination index (D) for various 
combinations of criterion group sizes and item difficulties. He found that, 
for any given combination within the parameter ranges studied, the rela- 
tion between D and r is very nearly linear. For 27-percent criterion groups 
and for 50-percent level of item difficulty, D is nearly equal to r. The 
slope of the line for the regression of D on r decreases as level of item 
difficulty decreases or increases. Applicability of these relations to actual 
item analysis data is limited by the assumption of a bivariate normal 
distribution. 
" Lingoes (1963) attempted to meet some of the criticisms which have 

een directed against the scaling techniques of Guttman, among which have 
been (a) lack of a clear meaning for his important concept of a universe 
of content and (b) unsatisfactory measurement of reproducibility. Lingoes 
maintained that his is a completely objective and empirical procedure for 
selecting dichotomous items which meet the Guttman scaling criteria. In 
his conclusion, Lingoes indicated that his method. which is moe objective 
than is scalogram analysis, can be used wherever the latter is applicable. 

Wohlwill (1963) explored the potentialities not only of applying the 
noncumulative, or point, model to psychological data but also of develop- 
ing satisfactory measures of scalability for noncumulative items. In the 
model proposed, the subject responds to only those items lying at or close 
to his own position on the scale. Although this model SEETHE to have a 
great potential for measurement in any preferential choice situation (such 
as those involving attitudes, interests, aesthetic judgments, food preferences: 
and some in developmental psychology), the problem of the measurement 
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of scalability remains to be solv extensi 

for the cumulative model om ee E p developed 
forward as it might appear. asina straight 
Hd A e ee the question of how the interpretation of test 
eres " ie by am units of measurement which represent different 
is Alheush cia ne ia measured at different points on the score scale. 
Baker pu > gem ity of units raises some difficult statistical questions, 
cds ae consideration of them in order to concentrate upon 
one Ws or the test constructor and test consumer. The consumer has 
: adit eh Eh gn to haim (al lo treat naively all test scores as a nice interval 
Baker felt 1 E. ry to take into account the variations in test score units. 
et om jat ! he fundamental concept in item analysis is the item charac- 
: Taria E e, i sooth curve fitted to the proportion of persons at each 
cs neni who make the particular response being studied. The 
v weder ae curve does provide much more information on the 
validity 1 us 2 an item than does either the difficulty index or the item 
imis d A urthermore, it provides the means to visualize easily the 
Alt es of an item and to specify them in precise mathematical terms. 
nough the alternative approach described offers no quick solution to 
the consumer, it was felt that the application of the approach might 

eventually help to eliminate the unequal interval problem. 
Ray, Hundleby, and Goldstein (1962) discussed the relation of item 


D 
parameters to test skewness and kurtosis. Both were shown to depend upon 
ariances. The trivariance and quadrivari- 


ation coefficients were also considered 
a were also given. 


i H 

tem means, variances, and cov 
a 

ange types of product-moment correl 
as ice iri 

is indicators of test parameters. Empirical dat 


Correction of Item-Total Score Coefficients for 


` Relevant Item Inclusion 


lations for relevant item inclu- 
developed indexes con- 
able. each of the present 
ndexes. Howard and 
on is probably 


al biserial corre 


The correction of item-tot 
previously 


sion was treated in two articles. Since 

cerned with the correction appeared to be question 

; articles represented an attempt to devise satisfactory ir 
Forehand (1962) pointed out that although the correcti 

not necessary for deciding between items involved in the process of 

measurement on a single scale. uncorrected formulas are not appli able 

to the problem of discriminant item analyses. In the latter case, in which 

a test is constructed to measure two or more traits, one could compare 

" the correlations of items with the total score of the scale in which they are 

included and with the total scores of the other scales. Such comparisons 

are difficult because of the spurious inflation resulting from inclusion of 

the relevant item. Two formulas for the corrected coefficient were given. 

One was called an exact formula: the other. an approximation to the 


exact formula. The approximation is a lower bound for the corrected 
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coefficient under certain specified conditions. Howard and Forehand sug- 
gested that the approximate formula could serve as a short cut. 
Henrysson (1963) developed a corrected formula. with the same numera- 
tor as in certain well-known existing formulas but a different denominator. 
The denominators represent different estimates of the standard deviation 
of the test containing the n—1 items that remain when any one item is 
excluded. However, Henrysson’s biserial item-remainder coefficient still 
suffers from the difficulty that the biserial coefficients are unduly influenced 
by unique parts of the item variances. He developed a second formula, the 
biserial correlation coefficient corrected for unique variance. For an em- 
pirical demonstration which was undertaken for a comparison of the 
various coefficients, Henrysson obtained values by the new item-remainder 
formula that were identical to those calculated from the raw data. The 
relevant item was removed each time so that the total score could be 
obtained. Values of the coefficients corrected for the influence of unique 


item variances are between those of the noncorrected coefficients and the 
item-remainder coefficients. 


Special Approaches 


Gruber and Weitman (1962) considered a problem for the theory of 
item analysis in longitudinal experimental studies. In this context it is 
desirable to construct tests that will be maximally sensitive to change in 
ability. For example, if the ability of the population changes, items having 
maximal discriminative power in a pretest will be reduced in discrimina- 
tive power in a posttest. Results suggested that item selection can be 
improved when account is taken of the direction of change in ability 
within the population tested. li 

McQuitty (1961) described a method for applying a usual type of item 
analysis to the selection of configurally significant items, He further 
showed how these items could be treated in either a summative fashion 
or a pattern score so that differentiation between groups could be realized. 

Additional References: Michael and others (1962) : Tupes (1963). 


Reliability 


Myers (1961) described a relationship betw 
matrix of test items and the concept of the sta 
standard length had been defined as the set of N items needed for a 
reliability of 0.50. Myers also described an index of item-test relationship 
in standard length terms and showed that the sum of these indexes for the 
items in a test is equal to the square of Jackson's coefficient of sensitivity- 

Cronbach and Azuma (1962) empirically compared several internal- 
consistency coefficients in terms of how well they serve various purposes. 
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The study was a by-product of previous attempts to reformulate the 
foundations of reliability theory and to offer an improved interpretation 
of coefficients. Tests were derived from a pool of dichotomous items, the 
content of which was not described although the statistical characteristics 
were stated. The results suggested that. for hypothetically random parallel 
tests, certain of the coefficients are more appropriate than others under 
variously specified conditions of mean item intercorrelation. 

In his use of a relatively homogeneous scale of 48 items intended to 


measure academic self-concept. Payne (1963) compared five frequently 
es of reliability with respect to six 


employed internal-consistency estimat 
Variations in skewness of criterion score distributions for different samples. 
Results indicated a high degree of similarity among the several estimates 
both within and across samples. The author further concluded that it made 
little difference which estimate of internal consistency was used. 

Webster (1962) devised a reliability coefficient of the ordering that is 
achieved by test scores. The coeflicient. which is simple to calculate, is 
practically insensitive to skewness. Furthermore, as skewness in total test 
scores decreases. the coeflicient closely approaches in value, depending 
upon the item scoring convention employed, the estimate furnished by 
Kuder-Richardson Formula 21 or a previously generalized estimate given 
for Kuder-Richardson Formula 21. It seems to be applicable for certain 
item selection procedures for which al tests and numerous poten- 


tial test items are available. 


long initi 


Norms 


Ebel (1961) compared three methods of 
8 (a) adjusted linear equating, (b) equiper- 


centile equating, and (c) adjusted equipercentile equating methods. The 
tests used were two forms made up from mathematics items. The methods 
were found to be almost equ On the basis of experience m 


this study, it was planned to develop d reliable empirical 
comparison of various methods. m 
per (1962b) pode a solution to the practical problem of m 
representative national norms for school achievement tests Madre ap 
pling in order that improvemen ; ative efforts mm iden 
the norming operation might be realized. Specifically. he raise pa an 
tion concerning whether the performance of a large group on 4 2 E i 
could be estimated through the administration of only a few items to each 


i i , r to sa 
student. The results suggested that in some cases it may be better to sample 
$ -as limited in that items were sampled 


items than examinees. This study W ; 4 
without replacement. The plan could be improved if samples of items 
additional item 


wer e 1 zhile same time 
vere taken without replacement, while at the s A d 
awn SO that eac "era erent 
aw acl n several € ff 


a item would appear ! 
;roblem of the optimum number of items and 
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examinees to use for a particular norming job could not be solved in this 
study. Further research will be necessary in which evaluation can be made 
of the various possibilities of trading items for people in a wide variety 
of situations. 

A symposium chaired by Wesman (1962) featured papers on three 
types of standard scores for aptitude and achievement tests: (a) normative 
standard scores described by Gardner (1962). (b) content standard test 
scores detailed by Ebel (1962). and (c! scales with nonmeaningful origins 
and units of measurement reported by Angoff (1962). Flanagan's (1902) 
evaluative discussion served to synthesize much of the material presented 
and to relate the importance of content scores to the development. of 
norms for achievement tests in Project Talent. 


Prediction and Classification 


Hazewinkel (1962) showed how the square root method can be used 
in the derivation of formulas in multiple prediction and in the computa- 
tions necessary for their application. Hazewinkel derived a formula which 
relates the length of one test in a battery to the multiple correlation of the 
battery as a whole. A numerical example was also presented. i 

Berdie (1961) explored the effectiveness of moderator variables in 
improving predictions. His central problem was to find a basis for sep- 
arating his subjects into relevant subgroups in order to maximize efficiency 
of prediction for each subgroup. The general hypothesis tested was that 
the variability of an individual's behavior on a test provides a basis for 
selecting such subsamples. A more specific hypothesis tested was that 
prediction will be less effective for persons whose test behavior is more 
variable than for persons whose test behavior is less variable. Evidence 
was found to support the latter hypothesis. Reasons for these results were 
not forthcoming from this study. Only a more sophisticated analysis was 
thought to be the key to a revelation of the relationships between predictor 
and criterion variables for different subgroups. 

Ghiselli (1963) reviewed the literature on moderator variables, de- 
scribed several kinds of applications for them. and reported some of his 
own empirical research on their effect on reliability and validity. He con- 
cluded that although moderator variables ) 5 


appear to offer promise of great 
utility in improving validity and reliability, the time and effort devoted to 
developing and using them might be spent more profitably in seeking 
improvements along classical psychometric lines. ! 

Chandler (1961) examined the conditions under which a variable with 
positive validity can function as a suppressor variable. He described a 
criterion for identifying such cases and presented two formulas for deter- 
mining whether a suppr ts. The formulas are relevant in a three- 
variable prediction equation when all the elements in the intercorrelation 
matrix are positive. The formulas show that (a) the suppressor effect 
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disappears as the validities of the predictors become equal in magnitude 
and (b) the suppressor effect is most likely to occur as the absolute 
difference between the validities of the predictors increases. Furthermore, 
when the validity of a possible suppressor is near zero, a suppressor effect 
will prevail if the intercorrelation between the predictors exceeds twice the 
ratio of their validities. 
1* Karon (1964) proposed a distinctive treatment of the commonly found 
age variable which is important to psychological and sociological research. 
Other variables are rarely related to age in a simple linear fashion except 
over a small range. The usual solution is arbitrarily to divide age into 
4* class intervals and to treat the intervals as a set of discrete categories. 
There are several objections to this procedure. When one considers the 
psychological nature of age. it would seem reasonable to treat it as a 
continuous variable but not as an equal-unit variable. The appropriate 
transformation ought to be one in which the rate of change per year is 
inversely proportional to age. There is the suggestion of a sort of Weber- 
Fechner law applied in the context of time. Karon indicated that if age is 
used in a regression equation it be transformed to a logarithmic function 
and then considered to be related to other variables as a second-degree 
equation, In an illustrative example in which he used data from the effects 
of segregation as measured by a projective test, Karon showed that there 
was a remarkably high degree of correspondence of the fitted second- 
degree equation to the observed means of arrays. f . 
Lord (1962a, 1963) considered two cases in which the multiple-cutting- 
scores rather than the composite-score procedure was applicable. The 
assumptions as well as the mathematical model were substantially altered 
from the first to the second case. In both cases, the general conclusion was 
that when multiple-cutting scores are used with relatively unreliable pre- 
dictors, one would probably do well to allow a high value on one 
A ST low value on another. 
predictor to compensate at least partially for a i. McQuitty 
i” Additional References: Hall (1963); Hoffman ü 3 y 
(1961; 1963a, b); Tucker (1963) ; Watley (1964). 
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CHAPTER VIII 


A Short Evaluation of the Research Reviewed 
in Educational and Psychological Testing 


WILLIAM B. MICHAEL 


This issue affords the first opportunity for a second evaluation of research 
in any one of the title areas of either a three-year or five-year cycle of the 
Review. In February 1962, Harris wrote the first evaluation of the research 
that had been treated in the chapters of an issue of the REVIEW in the issue 
devoted to educational and psychological testing. In what may be viewed 
as a rather pessimistic assessment of the status of research in educational 
and psychological measurement at the end of 1961, Harris considered a 
number of points, among the most noteworthy of which were (a) the 
proper role of factor analysis in test development, (b) the relatively static 
level of predictability of tests of general and special aptitudes, (c) the lack 
of any new principles being proposed for the development of achievement 
tests as reflected, for example, by the absence of any influence of the 
Taxonomy of Educational Objectives (Bloom, 1956), (d) the omission o 

any discussion in the chapter concerned with achievement testing of the 
process or strategy of problem solving in concept attainment and achieve 
ment, (e) the utilitarian emphasis on validity coefficients in research on 
testing at the expense of interest in questions concerning why predictor 
variables behave as they do, (f) the injection of creativity as a new 
criterion which, with its inherent social significance, may pose the danger 
that an undue amount of effort in testing will be directed toward “how to 
maximize the prediction of this new criterion," and (g) the failure © 
the reviewers in the other chapters to indicate what, if anything, in meas 
urement theory had changed as a result of work over a period of three 
years. In indicating that actually part of the problem for lack of concern 
of chapter reviewers with measurement theory could be attributed to the 
absence of publication of any comprehensive work, Harris strongly hinte 

that the assessment of change or growth was an essential topic in the theory 
of measurement worthy of extensive treatment, 

Harris’ Points of concern serve as a partial basis for the generation 
of categories relative to which the research contributions reviewed in this 
issue may be evaluated, although substantial modifications are necessary 
The principal dimensions about which the evaluation of the research 
reviewed in this issue will be made include (a) the use of factor analysis: 
especially with respect to intellectual development and creativity, (b) the 
utilitarian emphasis in relation to prediction, (c) advances in achievement 
testing, (d) trends in measurement of noncognitive characteristics, an 
(e) advances in measurement theory. 
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Use of Factor Analysis 


In Chapter II, Millman and Glock appeared to agree with Harris' impli- 
cations that if theory is not given a primary role and if empirical studies 


are not allowed to furnish both positive and negative answers, then factor 


analysis not only can turn up entities with specious psychological mean- 
ssentially random process for generation of 
components that can be interpreted in a forced fashion as belonging to a 
theory. These writers shared Humphreys? (1962) opinion that many of 
Guilford’s tests are measuring highly specific abilities of questionable 
psychological importance and that the factorially pure tests have not been 
useful in the prediction of complex criteria. Moreover, in their virtual 
rejection of factor analytic models of intelligence as useful bases for under- 
standing the nature of intellectual organization. they seemed to favor 
placing reliance upon the developmental theories proposed by Piaget and 
his followers. 

On the other hand, in taking an almost opposite position in Chapter III 
to that set forth by Millman and Glock in Chapter II, Merrifield strongly 
Supported the significance of Guilford's contributions to the understanding 
of intelligence and creativity as well as the promise of the structure-of- 
intellect model as an informational theory of learning. It appears to the 
writer that Millman and Glock have been somewhat overly pessimistic 
regarding the potentialities of factor analysis as one basis for the under- 
standing of the nature of intellectual functions. Admittedly, although the 
level of confidence with which one may make an inference concerning the 
existence of a psychological ability, function, or process from use of factor 
analysis may not be so high as one might desire, the careful design of 
factor analytic studies in which overdetermination of the structure is 
encouraged with respect to those measures employed to duplicate the 
hypothesized processes should serve to strengthen the probability of an 
accurate inference. Moreover, the factor analytic approach also permits the 
generation of new hypotheses to radictory or unexpected 
groupings of correlated variables. In addition, frequent replication of 
studies should furnish additional information concerning the tenability of 
research hypotheses and of their consistency Within the theoretical frame- 


work from which they were derived. . - 4 
It should also be possible to combine factorial investigations toa “a 
tage with a traditional methodology of developmental studies of intelli- 
gence of the sort that Piaget and his students have followed. bs 
analytic approaches could serve not only to confirm ipee ea = 
stages of intellectual development but also to extend and enrich the con e 
of these developmental formulations. . 
As Millman end Glock suggested in Chapter I, Tucker's (1963b) new 
affords a potential means 


model for analysis of three-dimensional matrices tial n 
s ame individuals 


for investigation of interrelationships of abilities for the s 
This model should be useful in the study of the 


93 


MEN rere e ER EA 


ings but also can represent an e 


account for cont 


on numerous occasions. 


Review oF EDUCATIONAL RESEARCH —— Volume XXXV, No. 1 


influence of periodically introduced changes in the environment on = 
creative potentialities of individuals. Along with the use of T ucker E n : 
the potential link afforded by relating Guilford’s structure of intellect o 
each of the distinct stages of intellectual development set forth by Piaget 
and his followers may well be extended in a systematic way in order that 
clarification of the psychological characteristics of abilities involved at 
each of the stages of intellectual development may be effected. 

From the standpoint of test development. many positive outcomes can 
be realized from the judicious application of factor analytic techniques. 
In short, as factor analysis serves to clarify the nature of psychological 
constructs, substantial modifications in test development as well as im- 
proved representations of psychological processes should be expected. In 
particular, relatively pure measures of single functions or traits can be 
rather readily constructed, although their predictive validities may be 
somewhat low. . 

In the development of test batteries. there appear to be limitations in 
the use of pure factor tests relative to the realization of a maximum degree 
of predictive validity for a composite of predictors. In light of the complex 
nature of criterion tasks that involve simultaneous use and application of 
more than one ability, it would not seem reasonable that batteries of 
factorially pure tests would necessarily predict quite so well as those 
batteries derived from use of a traditional multiple-regression analysis. In 
such an approach, predictor variables may be found to represent a com: 
bination of both factorially complex tests and occasionally factorially pure 
tests. In factorially complex tests the abilities measured can be expected 
to interact much as they would be expecte 


d to do in the performance of 
most criterion tasks. 


Utilitarian Emphasis in Relation to Prediction 


The concern which Harris expressed three years ago regarding the 
utilitarian emphasis on predictive validity coefficients might well turn 
into a nightmare if shortly before retiring some evening he should rea 
certain chapters of this issue of the Review. Indeed. in Chapter V the 
numerous studies which McCall cited concerning the measurement 9 
vocational interests were strongly predictive in their emphasis. There 
appeared to be little concern in the contributions reviewed for a theory 9 
vocational interest or choice, although an extremely limited amount ° 
questionable research was done in the area of construct validation. McCall 
ably pointed out the lack of much concerted effort to relate interest meas 
ures to underlying motivational and learning variables in human behavior 

This same emphasis on predictive validity and on the citation of co 
efficients of correlation among test variables and criterion variables Was 
amply clear in the studies mentioned both in Chapter VI on personality 
measurement by Gaier and White and in Chapter IV on achievement test 
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ing by Bligh. What perhaps is most distressing is that during the past 30 
jets relatively little gain has occurred in the observed onu. of 
oo jr of tests or of batteries of tests employed in the predic- 
"à j of success in the academic setting, in the civil and military services, or 
Se bo Industry despite the rather extensive attention that has 
gh o the criterion problem. 

Harris’ expressed concern that creativity 
criterion toward which a disproportionate ‘effort in testing and prediction 
would be directed has not materialized during the past three years 
possibly because of the relatively conservative nature of most teaches 
and administrators in the educational setting as well as of personnel officers 
in the industrial and business world. Perhaps, in academic prediction, the 
Pro ael on relatively immediate utilitarian values that reflect well-estab- 
aS patterns of conforming behavior and convergent thinking in the 
customarily employed criterion measure of grade point average rather than 
a concentration on the development of a dynamic theory of learning and 
teaching with its promising implications for an improved description of 
the criteria of academic success has served to perpetuate the status quo 
in the obtained magnitudes of the predictive validities for tests of general 
and special aptitudes. Indeed, the rate of growth of statistical sophistication 
at seems to be an almost asymptotic limit reached 
s. Future progress would seem to accrue 
asingly complex techniques of 
hat will serve to explain why 
ademic settings. 


itself might become the new 


has far outstripped wh 
by predictive validity coefficient 
not so much from the development of incre 
Statistical analysis but from bold new ideas t 
individuals behave as they do in a variety of ac 


Advances in Achievement Testing 
egative comment that 


e degree Harris’ n 
xstruction of achieve- 


fluence on the cor 
IV. At least two studies were 
n made to duplicate in test 
Again, contrary to Harris’ 
tioned during the past 
el cognitive 


One bright note to dispel to som 
Bloom’s Taxonomy had had no in 
ment tests was visible in Bligh’s Chapter 
cited in which a conscientious attempt had bee 
construction the objectives of the Taxonomy. 
experience, there were several other studies men 
three years in which problem solving and other higher lev 
processes were important objectives to be evaluated in achievement tests. 
Thus, it seems that at last the Taxonomy is beginning to exert either 
directly or indirectly some degree of influence in the construction of 
objective achievement examinations. It may also be hoped that Mayo’s 
(1961) careful efforts to establish tbe minimum levels of competence in 
measurement which are needed by teachers may actually influence the 
practices of these obviously important individuals who are indeed in the 


key position to improve evaluation practices. 


Although, as reported in Chapters I and IV. 
] gains in test construction. 


several investigations in- 
in administration and 
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scoring, in norming and interpretive procedures, and in the fa ilitation : 
comparisons of new forms of tests with older forms were sca 
designed and carried out. there seemed to be no major s gnificant ad- 
vances or strikingly original contributions to research. As mentioned in 
the previous paragraph. the increased attention to objectives reflecting 


psychological processes has been a hopeful sign which suggests that indi- 
viduals involved in the construction and utilization of achievement om 

inni T *tedlv 
are beginning to emerge from the doldrums that Harris so pomtec^ 


mentioned. 


Trends in the Measurement of Noncognitive 
Characteristics 


Just as it appeared that there were few significant. advances in the 
measurement of vocational interests, there was an absence of noteworthy 
progress in the research reported by Gaier and White in Chapter yi 
regarding the measurement of personality. Although there was a substantial 
amount of systematic effort expended in factor analytic studies. often al 
the expense of relating the definitions of the traits isolated to a theoretical 
framework, it may be that the computer simulation of personality wil 
constitute an important avenue for basic research during the next few 
years. Despite the wealth of research on response sets, the basic issue 
concerning whether such sets reflect a stvle of responding to the individual 
items instead of reactions to their specific content was not resolved. Of at 
least moderate significance in the research reviewed were trends associate’ 
with (a) the attempt to clarify the meanings of the constructs of self- 
concept and body image and to show their relationship to personality 
development in conjunction with attempts at their measurement. (b) the 
formulation of a multidimensional interpretation of the concept of anxiety 
and the development of measures to take into account situational variables 
including both the task and the examiner, and (c) continued direction ° 
effort toward achieving improved definitions and measures of a family o 
constructs associated with need achievement— especially in relation to the 
presence or absence of anxiety characteristics, i 

Although no attention in Chapter VI was given to Krathwohl, Bloom 
and Masia’s (1961) extension of Bloom’s Taxonomy to the affective 
domain, brief mention was made of it by Moughamian in Chapter 7 
The schema afforded should offer teachers an important frame of reference 
in their writing test items for affective objectives in the curriculum. One 
can only hope that the lag between the appearance of this significant 
volume and its translation into active classroom practice will not be 5° 
great as that which arose in the instance of the Taxonomy in the cognitive 
domain edited by Bloom (1956). Certain individuals who are predispose 
toward an attitude of cynicism with respect to educational measureme" 
may well point out that if the delay in applying knowledge from e 
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no a the cognitive domain was of the order of 6 to 8 years, the 
= T A Un "d more difficult. affective domain may well run from 
"slide years. Even a short lag in the application of the principles set 
orth would be an unfortunate loss to the evaluation of important objec- 
tives in the learning process. 


Advances in Measurement Theory 


á A consideration of the contents of Chapter VII, by Mayo, suggests that 
uring the past three years the relative emphasis on applications of 
Statistical methodology to problems in item analysis and selection, to 
estimates of test reliability, and to the development of norms has been 
more Practical than during previous three-year cycles. However, in view of 
his elimination of consideration of the topics of factor analysis and scaling, 
which were covered in the December 1963 issue of the REVIEW, and of 
the material treated by Tucker (1963c) in the Annual Review of Psy- 
chology, and in light of the possibility of the operation of selective per- 
ception on Mayo’s part, the emphasis portrayed may be unintentionally 
somewhat biased toward a practical orientation. 

Harris’ observation that during a three-year period from late 1958 to 
late 1961 little concern was given to measurement theory may have been 
sustained to some extent during the last three-year period. However, there 


have been at least three significant developments that should be mentioned. 
Particularly noteworthy was Problems in Measuring Change, edited by 
Harris (1963), which contains the contributions of distinguished partici- 
Pants at a three-day conference beginning on April 30, 1962, at the Uni- 

]l be found the one 


Versity of Wisconsin. In this collection of papers wi 1 
of three-way matrices for 


by Tucker (1963b) concerning the analysis ) 
measurement of change, to which previous reference has been made. This 
valuable book should fill an important gap in measurement theory, the 
Significance of which Harris clearly anticipated in his evaluative chapter 
in the February 1962 issue of the REVIEW. 

Another noteworthy contribution was the monograph by Tucker 
(1963a), Formal Models for a Central. Prediction System. His objective 
was to develop a system for predicting the performance of students. ata 
number of different colleges when allowances were made systematically 


for differences in secondary sc ell as for introduction of 
res. Tucker's work represents à highly 


new variables such as test sco he 4 
sophisticated extension and improvement of the objectives which Bloom 
and Peters (1961) presented in The Use of Academic Prediction Scales 

for Counseling and Selecting College Entrants. . 
Ghiselli’s (1964) Theory of Psychological Measurement constitutes the 
ement. It may be expected 


third major contribution to the theory of measureme! : 
that the teaching value of this text will be a relatively greater influence 
h practices. One may hopefully anticipate 


than will be its impact on researc 
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that it will encourage competent. undergraduate majors in psychology as 
well as graduate students in both psychology and education to direct their 
efforts toward further studies in measurement theory. 

What impact the volumes edited by Harris (1963) and prepared by 
Tucker (1963a) will have upon educational and psychological measure- 
ment and especially upon its practice remains to be seen. The availability 
of electronic computers with tremendous storage capacities should make 
possible the relatively easy application of many of the quantitative formu- 
lations that were developed. In short, the advances in measurement theory 
during the past three years seem to have altered the implied lack of 
progress set forth by Harris in his evaluative chapter. 


Conclusion 


During the past three years, the research reviewed in educational and 
psychological testing has shown evidence of being only slightly more 
imaginative in conception, somewhat more rigorous in design. and con- 
siderably more extensive in scope than that reviewed during the previous 
three-year period. Although no major breakthroughs were effected, the 
research studies reviewed reflected (a) continued and expanding use of 
factor analysis, (b) systematic efforts directed toward the study and 
measurement of intellectual development and organization, (c) a concern 
for measurement of process objectives in achievement examinations as 
well as for evaluation of affective objectives in the curriculum, (d) only 
slight advances in the measurement of vocational interests and personality 
characteristics with the possible exception of the potentialities afforded by 
initial research efforts in computer simulation of personality, and (e) à 
modest degree of over-all progress in measurement theory, although the 
contributions to the assessment of change in the volume edited by Harris 
Eun Us D. e publications of Tucker (1963a, b) devoted respec- 

yt elopment of a formal prediction system and to the facto! 
analysis of three-dimensional matrices were particularly noteworthy: 
Welcome as major advances in statistical methodology may be, the critica 
need in educational and psychological testing is nat for additional statisti- 
cal sophistication but rather for the formulation of a comprehensive an 
workable theory of the teaching and learning process to which measure 
ment and evaluation procedures can be meaningfully related. 
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FOREWORD 


It is customary. we know, to compare the states of the field peel 
the previous and current issues of the Review. However. we — n 
to extrapolate the trends. many of which are obvious. to pec s 
expect of research on early childhood when the next issue of the coe 
on this subject is prepared. Some nascent tendencies are worth n i K 
ta) The ties between educational research and practice and basic reseat à 
issues will become closer. In this issue. for example. Williams’ pend 
of research on reading indicates the analysis of this complex process 1n 


m 
basic perceptual and learning issues: Huttenlocher points to some Ó 
research on cognitive development that may provide the research directio i 
for mathematics and science curricula that Leiderman describes. Balam 
ties classroom interaction to psychological ecology. and the extensions i 
Hartup’s review of social behavior to the ubiquitous problems of classroor 
motivation seem obvious. (b) New curricula will continue to be design” 
for the elementary grades, with attention to special groups (see Chapter A 
and for younger than usual age groups. as indicated by Williams 2! 


E E ^ the 
Leiderman. We would hope that these curricula will be subjected to 
intensive in situ observation that Amidon h 
variables. 


A a jona 
as described for socioemot 
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CHAPTER I 


Naturalistic Studies of Classroom Learning 


r 
CLARA P. BALDWIN 


criticisms and cautions regarding the use of na- 
turalistic observations for studying the behavior of children and of their 
teachers, several studies within the past six years have made valuable 
hodology. identified categories of teacher and child 
dy of classroom problems. and yielded find- 
s. The brevity and purposes of this chapter 
review to that research which appears to be 


Despite the justifiable 


contributions to meth 
behavior important to the stu 
ings regarding these problem 
make it necessary to limit the 
most fruitful and most soundly conceived. 


Psychological Ecology 


ssociates, adding the concept of psycholog- 
aturalistic observations, have for 
f children in various behavioral 
behavior of children. Their 
jor and some of the comparisons of settings 
psychological ecology have been 
1954) and by Wright (1960). 


as one of the settings 


. Barker and Wright and their a 
ical ecology to the methodology of n 
he behavior o 


several years investigated t 
a science of the 


settings in order to build 
methods of recording behav 
in which these concepts and methods of 
used were described by Barker and Wright ( 
Wright and others ( 1951) concentrated on the school [ i 
in which all children in a society spend much of their time. They described 
the importance of obtaining observations of classroom behavior and con- 


ditions in order to increase understanding regarding both laws of social 
acteristics of individual pupils. In this paper 


behavior and social char ' p 
ple scoring of classroom behavior of a 10- 


were presented data and sam 
year-old boy from the Midwest study. — 

Kerkman (1963), in his doctoral dissertation completed under Wright's 
direction, described differences in behavior patterns in nine city ( popula- 
tion of about 25.000) classrooms and nine town (a number of small 
i Three classes each at 


villages with populations of about 750) classrooms. : 
the first, third, and fifth grade levels were observed. Classroom behavior 


was recorded with a time-lapse motion picture camera ys ee 

" " a school day. Twenty-one 
recorded one picture per minute throughout the z ol ay I a 
scoring categories were defined and applied to the most visible chil- 
dren—five boys and five ajrls—in each scene. These activities were classi- 
fied into five main categories. with subdivisions: (a) conformance—degree 


to which the children meet the claim of the setting. the claim defined by 
and social milieu make of the child—(b) 


the demands that the physical 
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nonconformance. (c! semiconformance. (d) social components. (e) tn 
sidual base measures—negative affect. good motor tone. ete. A ime 
frame analysis was made on the behavior of each of the 10 subject dfi 
dren, and rater agreements ranged from 81.7 percent to 99.3 percent. bier 
a median of 95.6 percent. Over three-fourths of the behavior cantoni 
to the claim of the setting. while the remaining behavior was nearly equa A 
divided between nonconformance and semiconformance. The four mos 
prevalent behaviors in all schools were sedentary bookwork. aitending x 
teacher, looking away. and communicating with peers. Pupil pia 
showed marked similarities in the different sized communities. Nevert A 
less, organized group participation, attending to teacher. and total teache 7 
child interaction were more prevalent in the city. while autonomous actio 
by the child, heterogeneity of pattern. and engaging of the Papier oe 
curred more frequently in town. In both types of community. indepen ae 
of the child’s activity increased. and teacher-organized activity decreasec- 
with grade level. . " 
Shure (1963) adapted Barker and Wright’s idea of psychological iis. 
ting to the study of the nursery school. Shure's is the only recent naturalis- 
tic study of preschool programs. She divided the indoor area of the nursery 
school into five activity settings: block corner, art area, housekeeping 4A 
story area, and science corner. Using a time-sampling technique. fal 
served these five areas during the free play period for number of chile e l 
using the area (population density), amounts of several kinds of a 
participation, constructiveness, and affect. She found that there were ee 
ferent population densities as well as different kinds of child hehavios t 
the various areas. Her hypothesis was that the different activities O° 
nursery school may, in the language of Barker and Wright, “coerce ce " 
tain kinds of child behavior. Since this study was carried out in only gen 
school and with one teacher, much further work needs to be done » 
document the findings. In light of the upsurge of interest in education? 
programs for young children and the lack of systematic research a! 


Am : i d : vestigia" 
empirical data concerning nursery school education. this line of investi£ 
tion should be extended. 


Achievement and Self-Esteem 


For the past several years, Sears (1957. 1963) has been engaged in x 
large-scale research. program utilizing behavior observations as well il 
other measures to study the achievement and self-esteem of elementary 
school children within their classrooms. She developed measures of € 
study of relationships between classroom variables (teacher behavior 
teacher preferences for school activities. teacher perceptions of indivi n 
children, and peer perceptions and liking for other children) and ip 
educational outcomes on the part of the children (self-concepts, liking - 
other children, task-oriented classroom behavior, achievement test sco™ 
attitudes toward school activities, and creativity test scores). 
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Several findings from Sea (1963) major study, which involved 195 
? children and 7 teachers in the fifth and sixth grades, are concerned with 
the relationships between teacher behaviors and some of the pupil “target 
variables.” Among the findings reported by Sears (1963) are these: (a) 
Children of superior mental ability who received high scores on creativity 
r tended to have teachers who rewarded children individually with personal 
interest and praise. These teachers avoided rewarding by evaluation of 
the children’s performance. (b) Ìn contrast. in classes preoccupied with 
task-oriented work. teachers’ rewards tended to be given in evaluative 
terms. and to the group rather than to individuals. They tended to teach 
by direct presentation of information. (c) Achievement test scores of su- 
perior boys were predicted by such teacher behaviors as rewarding by 
empha punishing by reasoning. and asking for explanations. Sears 
i commented: “Sheer frequency of independent. task-oriented work does not 
better scores on achievement tests for most children 
ed with good self-concepts and/or liking by 
others for the children who are below the group mean in mental ability.” 
Three of Sears's students reported the. development of two of the be- 
havioral measures and data secured through their use. Kowatrakul (1959) 
described the category system used in the point-time sampling technique 
for studying children’s task-oriented classroom behavior. She reported that 
some 80 percent of observed behavior in 59 fifth graders and sixth graders 
was in teacher-approved categories. Children were most gm ongoing 
work” when they were engaged in “independent seat work and “watching 
and listening” activities. There did not appear to be significant ag 
in attentiveness to work going on in different subject TANE eae, Mel “lle 
sixth grade, though subject matter had an effect in the fiith um i Pep 
(1959) utilized two of the categories from this observational schedule 
: : Sanc Er hildren in first and second grade 
compare the industrious behavior of childr S mr i 2 
classrooms with their achievement and work-oriente e. aerei igh 
standardized doll play situation. She found that e » a = Tar rin 
in industrious behavior exhibited consistent kinds of wr ed 
play: children who were intent on classroom us ep e activities. 
frequently depicted child dolls as being Lair ad án ps tay in which 
Spaulding (1961) analyzed teacher-child ome microphone. a radio trans- 
teacher verbalizations were recorded with a a nt vi idt canasta: 
mitter, and a tape recorder and were score 9 


ior i ‘th and sixth grade 
! Spaulding collected data on teacher behavior in 21 d za se 
classrooms in nine elementary sc 


hools of an upper m ; ; 
i »eoriz ows: approval. 
California city. Teacher behavior was categorized bd um — 
; AES ster 1 a i 
disapproval, instruction (managing. structuring). liste ial 
communication with adults, and temporary 


bsence from 
classroom. Using these teacher behaviors (factor-analyzed into 17 factors) 
as antecedent variables and several pupil target 


variables as consequent 
r " s. Some of his major 
variables, Spaulding tested a number of hypotheses. S e j 
(a) the 21 classrooms were signi 


guarantee a. payoff in 
in the sample. but it is associat 


personal activity, 


S ficantly different on 
findings were these: 
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all the pupil target variables: tb) level of self-esteem was related positively 
to one component of teacher behavior. namely “calm, acceptant transac- 
tions in general with private individualized instruction and concern for 
divergency. attention to task. and use of task-appropriate procedures and 
resources”: (c) the category of “businesslike lecture methods with insist: 
ence upon attention to task and conformity to rules of procedure ^ was 
found to be correlated significantly with pupils’ gains in reading ant 
mathematics and with problem-solving performance. There was a negative 
relationship between dominating-threatening teacher behavior and pupils 
gains in reading. Also of interest to those concerned with observations of 
classroom behavior is the book of case studies of individual children from 
this research program. 

In contrast to the studies reported in this chapter which presented group 
data only. a recent book by Sears and Sherman (1964) included data from 
a variety of measures and behavior observations on individual children. 


Teacher Effectiveness 


Medley and Mitzel have been engaged in observational studies as part 
of a longitudinal study of teaching effectiveness. Their first step Was t° 
develop a simple observational scheme which could be reliably used bY 
observers with a minimum amount of training and which would enable 
them to collect data on teacher behavior in a late number of classrooms. 
Drawing on the methods of Cornell. Lindvall, and Saupe (1952) and 9 
Withall (1951), Medley and Mitzel developed a measure known a$ the 
OScAR (Observation Schedule and Record). Medley and Mitzel (1958: 
1963) described the development of this schedule and reported differ" 
ences among 49 beginning teachers scattere j a d 
19 schools in the New 
analyzed to select the vari 


d over grades 3. 4, anc 
York City school system. The data were 
ables that showed reliable differences among 


Hawes : E vel 
classrooms of the different teachers. Those items which discriminated 3 e 
were combined om 


ono 
cher 


into 14 keys on the basis of a priori judgment. 
examples of these keys are problem-structuring teacher statements: aut 
mous administrative groupings, freedom of movement, manifest te4 
hostility, and supportive teacher behavior. Reliability coefficients O” the 
keys ranged from 0.61 to 0.91, A factor analysis of the 14 keys resulte 
in three factors: (a) warmth of emotional climate, (b) degree of V 
emphasis, and (c) prevalence of pupil.initiated activities. 

In another study, Medley and Mitzel (1959) correlated the factor $° 
for these 49 classrooms with five measures of teacher effectiveness: "^ 
adjusted reading growth. (b) growth in group problem-solving skill, > 
pupil-teacher rapport, (d) teachers? self-ratings, and (e) principals’ rating 
No significant relationships were found between measures of classro? — 
climate, verbal emphasis. or social organization (the three factors n 
observational data) and any of the five measures of effectiveness. 
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Summary and Conclusion 


In spite R. CE 

utu he ry he win i il t de 
siren Far example. 26 = dos categories employed are surprisingly 
support. or [me e a pe a approval (sometimes including warmth. 
in all the emi de: ben ; ew (managing. structuring) occurred 
ieee gece te bn = üles of teacher behavior. The child behavior 
mr an Nes seke i i ^» s and Wright were aimed primarily at the 
MBs ber Pia 6 ee 3 : dren’s classroom behavior and their responsive- 
devis Pe eu of the academic classroom. This agreement among 

identifying similar variables may be a good indication of 


the i i 

> ie i of these behaviors 

ie s les 2 it, 

T hang all reported satisfactory observer or 
Mitzel i rper a and rating the behaviors being studied. Medley and 
WERL ard . é pointed to the importance of estimating both observer agree: 
a ste e : l H 
stability of behavior as aspects of reliability: they described 


measures of both. 

M Cr pes of re 

aen ee s newer techniques—one 

pose proble i ne other, automatic photograp 
lems. both of these latter techniques hold promise for future use. 


Medl 

ey a Te aen 

hoped f: al Mitzel noted that. because of background noise in recording 

verbalizations " : 

erbalizations electronically. they found observers to be more 
ght. in à personal communication. commented 


ellecti 
m rim T 
tern of 1E he plita iaraa technique picked up much of the formal pat- 
Boing on. S m unctioning. it lost most of the content of what was 
together sil ni PIDE mentioned having used an observer im the classroom 
of the obser s a radio transmitter, though he did not describe the function 
both the | ipis For simple and efficient on-the-spot recording by observers. 
ie OScAR and the Sears category systems seem very useful for fur- 
ehani ‘r teachers. One 


thet a 

CT studies M s : 
dies of specific kinds of behavior of children or te 

archers will utilize. and build on the work 


mie 

SUEht hope that future rese 

ment regi Ve in all these studies rather than resort to the develop- 
The most E ciel new &ystents. 

tions hei : striking. feature of th 

Comes of on descriptions of teacl 

; of the educational proces 


industry 

ry and self-esteem are relate 
rievement anc 
lassroom vari 


rater agreement in 


itional observational pro- 
used neck microphones with radio 
hic recording. Although they 


cording included the trad 


ese studies is the failure to find cleat rela- 
asures of the out- 


ing behavior and mea 
as fantasy measures of 


d to pupils’ classroom behavior. but 
1 growth in skills are not con- 
ables employed in these 


a 
Variables such 


such vari 
xn LM as academic acl 
Stently predicted 
AND. ed by à 
studies, by any of the € 
One 
jns problem certainly 
er-chj P, o 
used 2 child classroom interactions. 
Siricisd observational studies on botl 
i at 1 H H 
almost primarily to social-emotional 
st no studies of the subject ™ 


ed to describe 


asures being us 
ors presently 


behavi 
Idren have been re- 
There are 
ild inter- 
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actions or of the way teachers actually present new information. answer 
questions, or encourage a search for underlying principles. These behaviors 
should relate to pupils’ cognitive development. and the fact that sp 
can validly judge their own skill in presenting subject matter (Medley anc 
Mitzel, 1959) giv hope that aspects of cognitive development can be 
detected in observations of teacher-child interactions. 

But even though the variables employed in present studies are not ex- 
plicitly cognitive, it is still a surprise and a disappointment that they do 
not relate to educational outcomes. Does it really make little difference 
whether a teacher is warm or hostile? Is a child's industriousness in 
school (high for many pupils. according to both Kerkman and Kowatra- 
kul) actually unrelated to academic growth? At the moment. the data can 
be so interpreted, but such an interpretation does not fit with the Mm 
general theories of child development that are currently finding empirica 
support. For example. the role of warmth between children and adults in 
the child’s imitation of adult behavior (Bandura and Huston. 1961 ] an 
in the development of conscience and acceptance of adult standards (Sears: 
Maccoby, and Levin, 1957) suggests that warmth facilitates the child's dë 
quisition of information. These studies indicate that a warm teacher-chile 
relationship is an important but not sufficient prerequisite for optima 
cognitive development. On the other hand. a global warmth or approva 
that does not discriminate between those child behaviors that are approve! 
and those that are disapproved might well not show any facilitative effect 
on cognitive development. Thus it may be that pupil growth in cognitive 
skills will in future studies be related to the interaction of social-emotion? 
and cognitive variables in teacher-child interaction. Sears. (1963) indi- 
cated that teacher behaviors have different effects on children of differing 
mental abilities and sex. Such data point to the importance of considering 


i s : 4 A z " 35 
the effectiveness of certain teacher behaviors in relation to specific grou] 


of children within total classrooms, 
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CHAPTER II 


Children’s Intellectual Development 


JANELLEN HUTTENLOCHER * 


Within the past six years, the period with which this chapter is con- 
cerned, children’s intellectual development has been studied extensively- 
The brevity of this chapter precludes even summary treatment of the 
entire field. Therefore. only two topics have been selected for discus- 
sion, and even these are treated very selectively. The child’s acquisition © 
his native language will be treated first. followed by the development of 
his thinking processes between the age of four or five years (after he has 
mastered the essentials of language) and the beginning of adolescence 
(when he is generally believed to function as an adult). 


Acquisition of Language 


The acquisition of language is a central feature of the child’s intellectual 
development during the preschool period and represents an achievement 
of considerable complexity. The young child determines from the speech 
around him the rules that govern its usage so that he is able to understant 
and to produce well-constructed utterances. Systematic consideration ° 
this achievement is one of the major recent advances in the study of chil- 
dren’s intellectual development. While earlier studies measured mastery 
of language in terms of vocabulary size and sentence length. attention has 
now become focused, in the words of a recent conferens. on “acquisition 
of language-as-it-is-described-by-linguistic-science” (Bellugi and Brown: 
1961). This work explores the evolution of syntax, tracing the child’s im- 
plicit rules for combining words from the earliest multiple-word utterances 
until the period when the child’s speech approximates that of an adult. 

At the age of about one year the child makes his first meaningful utte™ 
ances. These are generally single words (daddy, light) and occasionally 
short phrases used as single words (gimme). They designate object: 
people. or complex situations. Many more single-unit atteyances occur I" 
the following months, and at about 18 months these units are first CO?" 


z : ; Dia j enis 
bined. By the time the child is three and a half, his speech is very simila 
to adult speech. 

" «and 

* The preparation of this chapter was supported in part by grants MH-06626-03 "por 


MH-05120-04 from the National Institutes of Health to Harvard University Cente 
Cognitive Studies. 
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Development of Syntax 


» 
Several observers have recorded the earliest combinations of words 
(Braine, 1963; Miller and Ervin, 1961; Brown and Fraser, 1961). The 
latter two articles have traced development throughout the succeeding two- 
^ year period. The first two-word utterances have a characteristic form 


noted by all observers. They generally include words in two classes: One 
class contains few members that are used with high frequency (e.g.. see, 
where). These have been called operators or pivots. The other class in- 
cludes all other words. Once the child combines one operator with another 
word, he often begins to combine it with a host of words (e.g. “ 
). For a brief period thereafter. all the child's speech 
ases. Over the next months such expressions 
xample. some steps in the progression 
of a child's description of a cat lapping milk might be “milk,” then “kitty 
drink.” then “kitty drink milk.” and finally “the kitty is drinking his 
milk.” At any one of these points in speech development, all of his utter- 
ances are of similar length and complexity. Whether the child is express- 
ing himself or attempting to imitate an adult. the form of his speech is 
rather similar; the child who says “kitty drink” spontaneously is apt to 
“the kitty is drinking his milk” as “kitty drink.” 
early forms of speech are function words, verb 
í The child relies heavily on con- 
he order in which they occur in 


doggie,” “see light” 
consists of such two-word phr: 
n gradually become elaborated. For e 


imitate his mother's 
Consistently omitted in the 
and noun inflections, and auxiliary verbs. 
generally producing these in t 


tent words 


adult speech. : 
As the child begins to use plurals, to vary his verb forms, and so on, 
the types of mistakes he makes indicate that the rules the child follows are 
encompass exceptions. For example, irregular 
an overgeneraliza- 


overgeneralized and do not T 
H “I goed, 


verbs are regularized, as when the child says 
| tion of the rule for forming the past tense. 
The early forms of speech are far from a r vedi 
output. Rather, those words are included which convey the prd em vs 
of a message, and thus this speech has been described as te os si 
Brown and Fraser (1961). Although the very youns child pe Se bis 
mastered the entire set of rning what he confronts. his gn 
reductions result in fairly gooc ion. Brown gel pochi E 
pointed out some factors that the child manages | 


i utterances. First. 

" extract the most meaningful wor d 
h Š ii f: oy may have prac- 

these content words are the type of words that the mother may have pra 


ticed with the child one at a time. For instance. snos en p - 
paired with objects: the mother picks up an (€ pota Mes A 
, Familiarity with a word may make it easter 10) the: € ipe n 
word from the context of a sentence and might be A dean si 
more likely to use it himself. Further. as Brown anc p z w mak hs 
content words receive the heaviest stress by adult spent e a + y Fw 
casiest to hear. Brown and Bellugi also gonsidere l the prove DEMS PAS 
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andom selection from adult 


rules gove 
1 communicat 
may explain how 
ds from fairly complex 
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child’s telegraphic speech becomes elaborated. Possibly. constant exposure 
to adult speech is sufficient for the child to elaborate his own: these 
authors. however. pointed to a further possible mechanism. Mothers. they 
noticed, tend to imitate the speech of their children. but in an elaborated 
form. If the child says “kitty drink.” the mother is apt to say “yes. the 
kitty is drinking his milk.” Such expansion by the mother may be im- 
portant. because it provides a close comparison. between the child's utter- 
ance and the adult form of that same utterance. 


Other Considerations 


Certain more general theoretical issues in language development have 
recently concerned psycholinguists. Several writers have expressed their 
belief that the general principles of association, reinforcement. and gen- 
eralization derived from learning experiments with animals are inadequate 
to explain the important facts about language (Chomsky. 1959; Lenne- 
berg, 1964; Miller, 1961). Learning theorists have claimed, for example, 
that children acquire language through imitation. Imitation. while clearly 
a factor, hardly seems a sufficient explanatory principle. The child does 
not imitate adult speech as he would a sequence of random items. When a 
random sequence exceeds the memory span. the middle items tend to be 
forgotten, and the first and last items tend to be retained. Instead. as We 
have seen, the child's reduction of adult speech efficiently extracts the most 
highly informative items. Further, the child's usage at each point in his 
development is quite systematic and can be described as following a set 
of rules. S 

_ The consistency of children's speech development, as well as the great 
similarity in the form of early speech in different children. has suggested 
to certain authors that the nature of the human nervous sysièm should be 
morati de eain a anaes dri Ther e 

8 uilt-in structure of the information 
processing system that enables the child to learn the complex rules ° 
pemer ME 1964; Miller, 1964). Lenneberg oy argued that 
speech should not be considered purposive b i iad. gain, 
but rather an inevitable Do rens ca ea Pecan 
organism. ua 

In a book written about 30 years ago but only recently available in 
English, Vygotsky (1962) discussed the evolution of E in the child's 
thought. In early childhood, he maintains. language eies mainly socia 
functions; later, it becomes internalized as the instrument of thought. In 
a recent paper. Bruner (1961) discussed the modes of thought preceding 
the use of language: enactive—the internalization of sensorimotor acts— 
and iconic—internalization of images. These ways of representing the 
world, Bruner notes. are far more limited than the symbolic—language— 
mode. When language is internalized. the possibilities for manipulation and 
for consideration of hypothetical occurrences are vastly increased. 
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Cognitive Learning Between Ages Four and Twelve 


While the child rapidly attains the rules of grammar, he grasps more 

slowly many other lawful relationships, such as certain general physical 

principles. Psychologists and educators are currently concerned with a 

^ related group of questions about school-age children. How much and what 
do children understand at particular ag s? How much more can they be 

taught at these same ages? In what ways can information best be pre- 


sented to them? 


Piaget's Position 
rew heavily on the theoretical 


f his published work predates 
is concerned. his impact on 


Recent discussions of the above problems d 

and empirical studies of Piaget. While much o 

the six-year period with which this chapter cerne 

thinking in this country is clearly contemporary. dating in large part from 

the English translation of The Growth of Logical Thinking from Childhood 

to Adolescence (Inhelder and Piaget. 1958). A detailed book-length sum- 

mary of Piaget's contributions has just appeared (Flavell, 1963), pes other 

summaries have been published recently (Hunt, 1961; Wallach, 1963). ; 

Piaget suggests that the child goes through a series of stages m his 

progression toward intellectual maturity- During each stage. the child has 

characteristic ways of viewing the world. That is, his thinking has a D» 

ferent structural organization at various points in his development. During 

each stage, the structure of the child's thinking for a time assumes a stable 

form. Further, the principles of organization during each stage are sid 

sidered quite pervasive, characterizing the child's mode of functioning in 

a wide variety of situations. . 

The age period of concern here, from age four or five gan pe 

F cence, spans two major distinct stages within Piaget's —€Ó mw of 

preoperational, lasts until about seven years The second, T ae E 

concrete operations, emerges at about seven years and wae d that the 

| start of adolescence. In the preoperational stage. it is eee objècts, 

child lacks understanding of certain basic relationships ge desit or 

For example, he cannot order them along a dimension, a= ite changes 

weight, and he is unaware that quantity remains pan 7 "er because 

in spatial arrangement. The inability to: conserve quen zs are reversible. 

the child does not understand that spatial rearrangemes ty of situations: 

Lack of conservation has been demonstrated x ice apa 

e.g., if two equivalent groups of a ae id dad e recognize that the 

out and the other closely clustere Ja the ehi learned to conserve 

number of objects is equivalent. Even after h E ie empiied into: axe 
number, he fails to conserve volume. If water 1n a ja 


i z t has changed; if the new 
iffer » child says the amoun Hs ub 
fale alm PE he believes there is more water; if it is shorter 


jar is taller and narrower, 
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and wider, he believes there is less water. The child at this stage places 
great reliance on immediate pereeptual clues; one arrangement "looks 
like” more. 

In the following stage of concrete operations. it is claimed. the child 
can carry out the operations of ordering and reversing and so can under- 


stand relationships between objects. but he still lacks the ability to formu- 
late propositions and consider possible re 


tionships between them: 1e 
al possibilities in a systematic 


he cannot specify in advance all hypothetic 
fashion. 


Piaget and Current Approaches to Thinking 
Piaget's theory, while formulated in large part some 30 years ago- has 


much in common with more recent approaches to adult cognitive function- 
ing. Newell, Shaw, and Simon (1958) suggested that we treat the problem 


solver as a “system for information processing" in pos ession of memories. 
operations for acting on memories and incoming stimulation. and rules for 
combining operations into larger programs for processing information. 
Bruner, Goodnow. and Austin (1956) have examined in detail the various 
strategies (a word essentially synonymous with programs) used in obtain- 
ing answers to certain concept-formation problems. Miller. Galanter. ant 
Pribram (1960) suggested that people's activities be examined in terms 
of the plans (also synonymous with programs! that guide them. Piaget's 
work can be described as an attempt to specify this information-processing 
system at various points in its evolution. f 

Recent studies of intellectual development not closely related to Piaget's 
work likewise concern the evolution of systems for processing information- 
For example, the above discussion of language development indicates that 
the child goes through a sequence of stages, possessing at each point 
gram for forming utterances. Early programs are both simpler an eu 
ferent from later ones. The evolution of strategies for grouping objects was 
studied by Bruner and Olver (1903). Young children tend to group object? 
by a common characteristic that relates the last object is: the one im- 


a pro- 


mediately preceding. Finally, after adopting certain intermediate strategie 
older children group objects into a superordinate class that includes "c 
a 


members. Huttenlocher (1961) found that children approach certain form 


À ; " o 
reasoning problems with the same strategy as adults, but may fail t 
execute the entire sequence of operations required. 
Experience and Cognitive Learning 
" 2 R wa if 
The sources of progression toward maturity are somewhat obscure va 
Piaget’s writings. While the importance of the child's interaction with AT. 
+ . . . ic yar 
environment 1s stressed. the rough designation of the ages at whic h a 
i 


ticular modes of functioning appear suggests limits to educability Y 
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. In his experiments. Piaget tested the child’s reactions 


particular age ranges. 
This procedure reflected 


to certain problems but did not attempt to teach. 
his belief that the child’s mode of functioning at each stage of his develop- 
ment is stable and not easily altered. 

Recent experimental attempts to tr 
use of concrete operations have had 1 
Wohlwill and Lowe (1962) attempted to train children to conserve num- 
ber. Different groups were trained (a) in counting elements before and 
after spatial rearrangement or (b) in adding or subtracting elements to il- 
lustrate those operations that do change number or (c) in dissociating 


number from configuration by arranging 


ain the preoperational child in the 
-ather limited success. For example. 


and rearranging elements. A 
as a control. All groups improved somewhat (includ- 
but there were no differences among them. Smeds- 
training children to conserve 
asticine ball that changed shape. He also found that 
of what he called “logical nece y." When. 
he tricked those children he had 
ready to renounce the. principle 
n who came into the experi- 
The training sessions in 


fourth group served 
ing the control group). 
lund (1961) likewise had limited success 


quantity given a pl 
training did not convey a sense 
by secretly removing some plasticine. 
trained to conserve quantity. they were 
as not true of childre 


notion of conservation. 
brief. It is not clear whether these 


acquisition of the general 


of conservation. This wa 
ment in possession of the 
such experiments were comparatively 
procedures should be regarded as suflicient for 
notion of conservation. 

Hunt (1961) discussed at length the manner in which experience may 
affect intellectual development. He maintained that the rapidity with which 
the child proceeds through Piaget's stages of development depends on the 
richness of his environment. His conclusions were drawn largely from ex- 

at the amount and 


periments with animals. For ex been found th 
tion to which young animals are exposed affects markedly 


nature. Hunt. also cited experimenls on the 
These experiments showed that after a great 
blems. primates become highly efficient in 
e same type. Their performance indicates 
acquisition of a systematic mod approach or strategy. Such strategies 
evolve only after fairly lengthy training. Perhaps the training provided by 
the child's natural environment is often too haphazard or not sufliciently 
prolonged for efficient development of those strategies of which the child 
is actually capable. This point of view suggests that the young child may 
be limited by ignorance of adul approach rather than by his 


anatomical or physiological deve 
In The Process of Education, Bruner (1960) discu 
ness to learn. He suggested that, in the absence of evide 


one assume that judicious choice of the mode of present 
to children of any age- Pointir 
formal symbolic 
l a spiral curricu 


ample. it has 


variety of stimula 
their ability to learn when 1 
development of learning sets. 
number of similar learning pro 


dealing with new problems of th 
e of 


t modes of 


Jopment. . . 
issed the child's readi- 


nce to the contrary. 
ation may permit 
; to the primary 
tion, he advocated 
lum in which the 


one to teach anything 


school child's difficulties with 
He sugg 


presenta 


estec 


use of concrete materia 
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same material is presented at different ages with demonstration gradually 
replaced by formal argument as the child matures. 

Perhaps, within the next few years, the present joint efforts of educators. 
specialists in particular disciplines. and psychologists will do a great deal 
to resolve our present uncertainties about children’s potential intellectual 
achievement. 
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CHAPTER III 


Social Behavior of Children 


WILLARD W. HARTUP 


Social behavior consists of activity elicited by stimuli emanating from 
people or activity which. in itself, possesses stimulus value for people. 
More than half of the literature in child psychology published since 1958 
is concerned with some aspect of social behavior. Two areas of research 
have been chosen for review in this chapter: (a) the effects of social 
reinforcement on children's behavior and (b) imitative behavior in chil- 
dren. Obviously, only selected trends in the methodology and findings of 
research in these areas can be indicated. 


Social Reinforcement 


Praise, approval. verbal demands. and reproof are well recognized as 
the stock-in-trade of every parent and every teacher for bringing about 
changes in the behavior of children. Nevertheless. a. systematic body of 
knowledge concerning the consequences of social reinforcement. has been 
late in emerging. Recent research consists largely of direct. empirical 
attempts to understand how children behave under various contingencies 
of reinforcement by adults. Formal theory has not played a very large role 
in shaping this research. Existing data clearly indicate that social forms 
of reinforcement are complex types of reward. in that numerous condi- 
tions determine the effectiveness of social rewards in a given situation. 
As yet, however. psychologists have not developed a parsimonious set 0 
integrating principles (a theory. in other words) which will effectively 
serve to organize and clarify the rapidly accumulating empirical evidence. 
Thus. Horowitz (1963) suggested that a reader of this literature MAY 
well feel that “while much is known. little is understood.” 

Most of the recent studies of children’s responsiveness to adult reinforce 
ment have utilized simple. short laboratory tasks. Factors influencing the 
effectiveness of social reinforcement have been observed most frequently 
with such tasks for the child as dropping marbles through holes anc 
pulling levers. Much has been learned by observing children's performance 
on these simple tasks, but the degree to which the findings may be gen 
eralized to life situations is restricted. Also. most of the recent research 
has involved adult reinforcement of children's behavior: little is presently 
known about such significant phenomena as children's reinforcement. 
adult behavior or of peer behavior. 
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Demographic Factors 


Characteristics such as the child's sex. chronological age, mental age 
and social class background have been investigated as factors affecting 
responsiveness to adult praise and approval. The sex of the child in 
relation to the sex of the adult who dispenses reinforcement, has been 
found consistently to be associated with performance in socially reinforced 
tasks. As a case in point, Stevenson (1961) reported that older preschool 
and younger elementary school boys performed better than girls on simple 
tasks when praised by a female adult. while girls performed better than 
boys when praised by a male adult. Similar results were reported by 
Gewirtz and Baer (1958b) for nursery school children and by Stevenson 
and Knights (1962) for retarded children. There is indication that this 
cross-sex effect extends beyond the preschool and middle childhood years. 


Stevenson and Allen (1964) reported that college-age women had higher 
asks when praised by men than when praised 
ates responded better when praised by 


rates of response on simple t 
by women. while male undergradu 
women than by men. 

Clear individual differences exist among 
rein forcers of children’s behavior. For example. Stevenson (1961) reported 
significant individual experimenter differences in reinforcing effectiveness 
when 12 adults were employed as experimenters. Individual differences 
tended to be more marked among female than among male experimenters. 
Women who were effective reinforcers of young children’s behavior tended 
to be less effective with older children and vice versa. On the other hand, 
the men who tended to be effective with young children tended also to be 
effective with older children. The personality and behavioral characteristics 
of adults which make them effective reinforcers of children’s behavior have 
proved difficult to isolate. and definitive results have yet to be published. 

Age differences in children’s performance under praise, criticism, and 
passive attention from adults were studied by Stevenson and Cruse (1961). 
The data suggested that social reinforcement was more effective for E 
year-olds than for 12-year-olds. but both groups did better under social 
reinforcement than under passive attention. The results indicated that the 
older children may have been uninterested in the simple marble-dropping 
task employed in ‘the study; however. under criticism. the older prs 
performed vigorously while the younger children quickly gave up. p - 
concerning the relation between mental age and socially reinforced La 
formance indicated that responsiveness to praise and approval is gie 
related to mental age up to an intellectual level between five m a 
years. However, for children above this mental age level, no d 3 m 
apparently exists (Stevenson and Fahel. 1961: Stevenson and Snyder. 


1960; Zigler, 1963). 

Social class differences were 
who found that praise was less € 
were adult statements relating to the correctness 


adults in their effectiveness as 


este 


reported by Zigler and Kanzer (1962). 
flective with middle class children than 
of the child's performance. 
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On the other hand, lower class children performed a simple task better 
when praised by an adult than when they were just told they were correct 
or right. 

Other data on subject and experimenter characteristics which influence 
children's performance in socially reinforced tasks indicate that unfamiliar 
adults are generally more effective in influencing children's performance 
by means of praise and approval than are parents of either sex (Stevenson. 
Keen, and Knights, 1963). Finally, Hartup (1961) reported that when 
either preschool or elementary school children are praised by their peers. 
performance is better maintained when the reinforcing peer is a disliked 
or unpopular child than when he is liked or popular. 


Social Deprivation 


Several studies have been concerned with the effects of brief periods of 
minimal social contact on children's responsiveness to praise and approval. 
For example, Gewirtz and Baer (1958b) found that praise produced 
greater changes in the performance of preschool children on simple tasks 
when preceded by a 20-minute period in which children were left alone 
in a laboratory room than when preceded by no specific waiting period. 
Quite similar results were obtained in a second study by Gewirtz and Baer 
(1958a), which involved first grade children. Praise and approval had the 
most marked effect when used after 20 minutes of social isolation, next 
greatest effect when used after no specific waiting period, and the least 
effect when used following 20 minutes of comfortable interaction between 
the child and the experimenter. Other findings, generally consistent with 
these results, were reported by Hartup (1958). Stevenson and Odom 
(1962), and Erickson (1962). i = 

The mechanisms underlying these experimental results have not yet been 
made explicit. Gewirtz and Baer argued that a period of brief isolation 
unl a social drive analogous to the hunger drive produced by food 
qe pim a e e der spat fr die ot 

s A n enhanced the performance of sixth 
grade children when praise was used as the reward but did not affect 
performance when a marble was used as the A Walters and Ray 
(1960) suggested that social isolation produces its affects on performance 
through arousal of anxiety, and Hill and Stevenson (1964) discussed the 
possibility that isolation motivates the subject as a consequence of general 
stimulus deprivation. At the moment, the fact that xüntmal social contact 
enhances socially reinforced performance is much better established than 
is the nature of the psychological processes eliciting this effect. 


Personality and Behavioral Correlates 


Investigation of children's personality and behavioral correlates of re- 
sponsiveness to social reinforcement has not been extensive. Perhaps the 
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best-substantiated finding is that the effects of praise and approval are 
somewhat greater for children evidencing strong. as opposed to weak 
dependency needs (Endsley and Hartup. 1960). There is also a dearth af 
knowledge concerning child rearing antecedents of responsiveness to social 
reinforcement. Some indication exists. however, that children whose par- 
and approval are less responsive to social rein- 


ks than are children whose parents are 
sez-faire in their approach to child 


ents frequently use praise 
forcement in simple laboratory tas 
somewhat less effusive or more lais 
rearing. 

Research interest in social-reinforcement effects on child behavior re- 
mains intense. During the past six years. there has been a remarkable 
concerning the effects of praise, approval. and 
Much remains to be done. The generality 
to be tested in terms of performance 
Also. further research is needed con- 
ognitive consequences of social de- 


increase in knowledge 
criticism on children’s behavior. 
of the results already obtained needs 
on other tasks and in life situations. 
cerning the basic motivational and c 
privation, social disapproval. and the like. 


Imitation 


The behavior of social models functions importantly in transmitting 
both deviant and socially approved patterns of behavior to young children. 
Bandura and Walters (1963) cited three possible effects of exposure to a 
model: (a) a modeling effect. whereby precisely imitative response pat- 
terns not previously present in the observers repertory are transmitted, 
(b) inhibitory or disinhibitory effects. whereby the frequency or intensity 
of previously learned response patterns is altered by observation of a 
model, and (c) elicitation effects, whereby observer responses are evoked 
which are not precisely imitative but are similar to the responses of the 
model. Recent research on imitation has been directed toward discovering 
both types of influence that models have on children's behavior and the 


determinants of imitation in children. 


Modeling Influences 


r and Dollard (1941) and more recent work 


David (1959) clearly established that the be- 
oung children with relevant cues in problem- 
€ the child of a model’s 


le. observation by 3 
hild observes the model being rewarded, 


ce discrimination problems. Research by 


Bandura and his colleagues has shown that models also exert — ee 
influences in the transmission of large segments of socially significant 
behavior. For instance. Bandura. Ross. and Ross (1961) exposed a group 
of nursery school children to adult models who used toys and other objects 
in an aggressive manner. As compared to another group of children who 
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responses, particularly when the c 
facilitates the solution of two-choi 


———— si‘ 


Review or EDUCATIONAL RESEARCH Volume XXXV, No. 2 


essive behavior. the experi- 
and 


observed models displaying inhibited. nonags 
mental group produced significantly more aggre 
equipment in a posttest. The results of another study showed that exposure 
to human models on movie film and observation of cartoon characters both 
were as effective as living models in transmitting aggressive patterns of 
behavior to young children (Bandura. Ro and Ross. 1963b). Some evi- 
dence exists that children's moral judgments can be altered as a conse- 
quence of observing models (Bandura and McDonald. 1963). On the other 
hand, little laboratory research has been conducted concerning the role 
such as 


ssive play with toys 


of imitation in transmitting other classes of social respons 
dependency. sex-typed behavior. anxiety. and achievement behavior. 


Response Consequences to the Model 


The extent to which the behavior of a model will influence the observer 
is at least partly contingent on the response consequences to the model. 
Bandura, Ross, and Ross (1963c) compared the amount of imitative ag 
gression elicited in four groups of preschool children: one which observed 
an aggressive model who was punished for his aggression, one which 
observed an aggressive model rewarded, one which had no exposure to 
models, and one which observed highly expressive but nonaggressive mod- 
els. The subjects who saw the aggressive model rewarded produced more 
aggression in a posttest than did subjects exposed to any of the other con- 
ditions. Those children who observed the model being punished for ag- 
gressive behavior did not differ from the control groups in amount 0 
prs ie on the posttest. Other studies indicate that models who 

rticularly isi i à 
m pide eed ee nad ru - paired dern 
1962; Bandura, Ross, and Ross, 1963a) p. 

, ana, , a). 

The studies by Bandura and his associates indicate that response con- 
sequences to the model influence a child’s performance of an imitative 
response rather than his acquisition of the response. Children who observed 
a model being punished for aggressive behavior showed relatively little 
imitative aggression when tested immediately after observing the model 
and, E d the experimenter that they disliked and dis 
approved of the model's behavior. When offer active i ives to 
reproduce the model's behavior, however. these eo iy dle very 
convincingly that they were able to do so. Thus, watching the model being 
punished did not increase the amount of spontaneous pesce shown 
by the children, even though they had obviously learned the response 
patterns of the model during their observation. ` 


Antecedents of Imitation 


New — a te child rearing antecedents of imitation has been 
y influenced by psychoanalytic theory that suggests that, in some 
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instances, the child is motivated to imitate. to identify with, the behavior 
of nurturant persons on whom the child is dependent. Psychoanalytic 
writers have postulated that in other instances identification occurs as a 
result of fear of punishment. 

Several investigations have been designed to discover whether nurturant 
interaction between an adult and a child predisposes the child to imitate 
the adult. In a study by Bandura and Huston (1961). a female model 
behaved “nurturantly” to preschool children in one group and “coolly” 
toward subjects in a second group. Then the model exhibited a variety of 
novel motor and verbal behaviors and demonstrated the correct solution 
to a simple discrimination problem. The novel motor and verbal actions of 


the model were imitated more frequently by the nurtured children than by 
On the other hand. nurturance by the model 


the non-nurtured children. 1 à 
imitation of the model's problem-solving behavior. Rosen- 


did not enhance : a 1 
blith (1959) investigated the effects of attentiveness and withdrawal of 
attentiveness on imitation by kindergarten children. Attentiveness seemed 

à as confined primarily to girls. The 


to enhance imitation. but the effect wy hild n 
relevance of nurturance to imitation Jn the child was 


Sears (1957). 
Ample evidence sugg 


discussed also by 


ts that aggressive. punitive parents tend to poe 
aggressive children. This evidence indirectly supports the [ee gie 
imitation in children is based on fear of punishment. n n 

i ss. and Ross. 1963c) indicate that 
hand, results cited above (Bandura, Ross. a pe nm db 
young children readily imitated aggressive be rue te gh m 
themselves were not threatened by the model. It mi areis € 
is motivated by anxiety in some instances, but cum raita 
be done to clarify the role of punishment as a ce ermina 


behavior. ests th 
gests 


One other theory of identification sug i 
i environment W 


at particularly powerful 
re pri ; sources 
ild’ ho are primary § 
models (i.e., persons in the child’s konman mu 2) d. dor 
of both rewards and punishments) will be on Rh nar 
example, Mussen and Distler (1959) found m = m a E 
ii i Y i cation. 2 
evidenced a high degree of masculine role pacis : rens sedie 
: l : ) 
less strongly identified boys. depicted po € eins Lo Sim 
i B ^ rewards dà s : 
relatively powerful source of both rewa errr 


rsery schoo 
dura, Ross. and Ross (1963a) reported peas d rer suum du iey 
to imitate adult models who possessed rewarding p 


i i desired rewards 
imitated (a) models who competed with the child for ci oe 
podes ; ither ontrolled reinforcement nor competed 10 
yw section of this chapter clearly indicate 


sses in the socialization of the child. 
ative behavior in children are varied 
and theory have been rapid. 
ag imitative phe- 


or (b) models who ne i 
The studies reviewed in the final 


ag Duc ipie 
the significance of imitative proc 


e determinants of imit 
in research 


i iclions nir 
ise predictions concert 
mediately ahead. 


Obviously, th 
and complex. Developments 
however, and increasingly p 
nomena should be possible ir 


——————— 


rec ; 
ı the years im 
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CHAPTER IV 


Teacher-Pupil Interaction 


EDMUND AMIDON and ANITA SIMON 


Since the early 1950's. a number of researchers have focused their 
attention on teacher-pupil interaction in the classroom. Bales (1950), an 
early observer of small-group interaction. described interaction as resulting 
when two or more persons behave overtly toward one another so that each 
receives some impression or perception of the other distinct enough to 
incur reaction. The studies reviewed in this chapter utilized observational 
data to measure the overt behavior of pupils and teachers as they interact. 
This chapter will present a brief de 


cription of some systems currently used 
to collect and categorize observational data involving teacher-pupil inter- 
action and then will summarize research related to the following areas: 
teaching patterns: achievement: climate. perception. and personality: and 
teacher education. 


Systems for Observing Teacher-Pupil Interaction 


Some of the systems that attempt to cat 
are concerned primarily with intellectual 
focus upon social-emotional behavior. and others are multidimensional. 
The systems described below are grouped in these three divisions: CO 
nitive, affective. and multidimensional. 

These systems—those briefly de: 


egorize teacher-pupil interaction 
activity in the classroom, some 


scribed and those listed as additional 
references—are the major systems currently in use to categorize teacher- 
pupil behavior directly or from taped classroom conversation, Some of 
these systems have been used only in secondary school classrooms. 


Cognitive Systems 


The system developed by Bellack and 


others (1963) contains three 
major categories: (a) pedagogic 


al moves. which are classified as struc- 
turing. soliciting. responding. and reacting: 
contains subcategories of substantive. 
instructional-log 


(b) content analysis. ier 
substantive-logical. instructional. ant 
‘al meanings: and (c) emotional meanings. defined in 


terms of valence. potency. and activity, Gallagher and Aschner (1963) and 
Aschner (1963) used a 


system containing four major categories for 
s: cognitive memory. convergent thinking- 
divergent thinking. and evaluative thinking. An additional category 15 
called routine. Taba and others (1961) developed a system that has two 
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basic categories: One deals with the pedagogical function of a thought 
unit, which can be either psychological or strategic: the other concerns 
levels of thought. which include grouping and labeling. interpreting and 


making inferences, and predicting consequences. Each of these has sub- 


headings. 


Affective Systems 


Withall (1949) developed a system which contains seven categories for 
teacher statements: learner-supportive. acceptant. problem-structuring. 
neutral, direction, reproving. and self-supporting. Flanders (Amidon and 
Flanders, 1963) developed a system containing seven categories for teacher 
talk: accepts feelings. praises. accepts ideas. questions. lectures, gives direc- 


tion. and criticizes. In addition. it contains two categories for pupil talk: 


student talk-responding and student talk-initiation. Another category notes 
vstem developed by Hughes (1963) divided teacher 


silence or confusion. A s 33 It 
behavior into these seven functions: controlling. imposition. facilitating. 
developing content. response. positive affectivity. and negative affectivity. 


Multidimensional Systems 


The Observation Schedule and Record (OScAR) developed by Medley 
and Mitzel (1958, 1963) is a comprehensive system for cataloguing teacher- 
pupil interaction, class structure, and classroom activities and materials. 
The three major dimensions of th al climate. verbal 
emphasis, à social organization. " . 

picos eei Kohn (1963) : Oliver and Shaver (1962) ; Smith 
and others (1962) ; Wright (1960). 


e system are emotion 


Teaching Patterns 


e description of how 


creativ 
“classroom 


Bellack and others (1963) presented f he rules of the 
" H ^| 7 Tr! S 

teachers and students interact, characterizing np her will be the 
game." For example, a rule for teachers 75 yl ee py ils is "the 
$ H , ^ $ rule for pupils 1s 
single most active person playing the game dido s teacher's solicita 
pupil's primary task in the game is to Ip ak - system, found in 
tions.” Bellack and his associates. usmg their category system, 


ir study of 15 high scho i dies teachers hat the two most 
tl g hool social stu that t t 
; : m discourse, making up nearly 50 percent of 


her solicitation—student response and (b) 
nse—teacher reaction. Although the 
l. it is relevant for elementary schools. 
and the classification of verbal 


common patterns of classroo 
the interaction, were (a) teac 
teacher solicitation—student respo 
study was done on the secondary leve 


Their system is extremely sophisticated, d classify a ty vescript made 

interaction requires that observers study and class? y, H E Shed 

from a tape recording Classifying emotional qualities from a printe 
a e e P, 
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typescript is difficult because there is little opportunity to take ons 
the effect of intonation and other cues on the affective climate of the c " 
room. Therefore, in order to classify emotional tone. Bellack — 
necessary to play the tape recordings collected in the study to a separa 
group of high school students whose reaction was then used to rate 

emotional tone of the tape-recorded classes. E , 

Giammatteo (1963), using Flanders’ system, studied 150 elementary 
school teachers during language arts periods. He found that teachers ac- 
counted for more than 50 percent of the talk in the classroom and that most 
student talk was in response to teachers’ questions. Although a large num- 
ber of teachers were included, each teacher was observed for only 45 
minutes. Using supervisory ratings as a criterion of teaching excellence, 
Amidon and Giammatteo (1965) compared the interaction patterns of 
30 “superior” teachers with 150 randomly selected teachers in 11 ele- 
mentary school districts. Using Flanders’ categories, they found that su- 
perior teachers talked less, accepted more student ideas, encouraged more 
pupil-initiated participation, and gave fewer directions than did average 
teachers. The lack of reliability of supervisory ratings calls into question 
their criteria for superior teaching. ] 

Using her own system of categorization, Hughes (1959) found in a 
study of 41 elementary school teachers that the median proportion of 
"controlling" acts was 46 percent. She also concluded that over 80 percent 
of the teachers studied were dominative in over half of the acts they per- 
formed. Control was the major function used, and simple memory-recall 
was the most common mental activity solicited by teachers. The teachers 
evaluated their students positively more often than negatively, but gave 
students criteria for evaluations less than half of the time. There appears 
to be opportunity for error in Hughes's study because the observers at- 
tempted to write down everything that happened in the classroom and to 
classify this later, using typescripts made from their notes. While Hughes 
stated that her intention was to describe teacher functions, she appeare 
to have built into her description “good” and “bad” values. Categories 
such as controlling, imposing, and negative affectivity, for instance, were 
discussed as if they were bad. 

Much of the work on teacher-pupil interaction patterns has been done 
by Flanders (1960) and his associates (Amidon and Flanders, 1963: 
Flanders and Amidon, 1962), Although the major portion of the data was 
collected in the seventh and eighth grades, the results are relevant for ele- 
mentary schools and are consistent with those reported by Giammatteo. 
Hughes, and Bellack. Perhaps these results are best summarized by 
Flanders' rule of two-thirds: in the average classroom someone is talking 
two-thirds of the time; two-thirds of this is teacher talk; and two-thirds of 
teacher talk consists of direct influence (lecture, direction giving, or criti- 
cism). Perhaps the greatest limitation found in research using Flanders 
system was the lack of emphasis upon specific cognitive aspects of class- 
room interaction. 
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The studies reviewed in this section are significant because they describe 
what Bellack referred to as “the rules of the game.” Apparently teachers 
and pupils follow common patterns in their classrooms which can be de- 
scribed and classified and which seem to indicate generally that classrooms 
are predominantly controlled by dominative teacher behavior. 

Additional References: Brown (1961): Miller (1964). 


Achievement 


Amidon and Flanders (1961), using the Flanders system, conducted a 
study in which 54 eighth grade classes were taught geometry in a two- 
acher using either direct influence (lecture, direction 
rect influence (accept student ideas and feel- 
They found that dependent-prone students 
m indirect teaching. In a more extensive 


study, Flanders (1960) found that seventh and eighth grade students in 
social studies and geometry learned significantly more when their teachers 
were indirect rather than direct. Above-average student participation, 
above-average acceptance of student ideas, and below-average teacher talk 
were associated with higher achievement. Schantz (1963), using Flanders 
categories, tested the effect of indirect and direct teaching on high and 


low ability children in fourth grade science lessons. She found that the 
t teacher influence scored significantly 


high ability group exposed to indirec u i n 
hiher vnde nr ste mi at the end of the experiment than did the high 
ability group exposed to direct teacher iien, by Bush (4984) which 

Spaulding (1963) tested a hypothesis sugges : di bove-average 
theorized that content-oriented teachers would Lun E is d aulding 
achievement in reading and arithmetic. Using kis ar not agnos the 
observed 21 elementary school teachers, but his data di pede poe 
Bush hypothesis. Taba (1961) trained o o ped v ur own category 
level of thought processes used by their pupils. aee eene on the same 
System, she found that some children of low IQ $ : ticipation as did 
high level of abstraction with the same foamy c - la strategies. 
some children of high IQ when their teachers use! poet a uie appro- 
Taba also found that children whose -- "dor beers inferences. She 
priate strategies gained significantly in ps ae 3 sed by pupils and any 
reported low correlations between level of though ps ability to abstract: 
of the variables traditionally thought to sone p» social 
intelligence. reading comprehension, achievement 1 


] studies, and 

! i hner 

y ges of 12 intellectually superior classes, Gallagher and is 

(1963) aienti to relate the type of nce qui m 
$ MON the part of pupils. FIV utiv ssro 

of divergent thinking an. g Using her categories for classifying thinking, 

donet (ound, bnt hen the percentage of divergent questions 


Aschner found, in general, that w 


hour period by a te 
giving, or criticism) or indi 
ings, praise, or question). 
learned significantly more fro 
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from the teacher was high. the percentage of divergent thinking production 
from children was also high. Conversely. when the percentage of divergent 
questions was low, divergent thinking produced by the children was also 
low. Boys tended to do more divergent thinking than girls. 

Additional References: Christensen (1960) : Gordon and Adler (1963) ; 
Heil and Washburne (1962): Mastin (1963): Medley and Mitzel (1959). 


Climate, Perception, and Personality 


Morse, Bloom, and Dunn (1961) attempted to determine the relation- 
ships among pupil perception, teacher perception, and the perception of 
trained outside observers of four dimensions of classroom activity: devel- 
opment. mental health. group processes. and achievement. The observers 
used a rating scale developed by Morse to describe interaction in 30 
classrooms in grades 5, 8, and 9. In general. if pupils rated one dimension 
high, they tended to rate all dimensions high: if they rated one dimension 
low, they rated all low. This was likewise true for the trained observers: 
who also saw the fours areas as compatible. On the other hand, teachers 
tended to rate the dimensions on an either-or basis. If teachers rated their 
classes high in achievement, they tended to rate them low in the other 
three dimensions. The authors also found that observers and pupils tendec 
to agree on their ratings of the achievement dimension, while the teachers 
perceptions correlated negatively with both. 

Ryans (1960), in a study involving a large number of elementary and 
secondary schools, found that his three dimensions of teacher behavior 
(friendly versus aloof, businesslike versus slipshod, and stimulating versus 
dull) yielded high intercorrelations in elementary school teaching but low 
intercorrelations in secondary school teaching. Ryans’ observers used rat- 
ing scales that were similar to those used by Morse. His findings in the 
elementary school tended to support those of Morse, that an observer Pe 
ceives certain teaching dimensions as interrelated: however, his secondary 
school findings did not support those of Morse. Flanders (1959). in ? 
series of studies spanning 10 years. attempted to relate pupil attitudes as 
measured by a climate index to teacher influence patterns as identified by 
trained observers using the Flanders system. The studies, carried out in 
elementary and secondary schools in the United States and New Zealand. 
indicated that a positive social-emotional climate tended to be associate 
with indirect teacher influence. 

Travers and others (1961). utilizing two large samples of elementary 
school teachers. attempted to relate teacher needs to teacher behavior- The 
four dimensions of needs (achievement. affiliation. recognition, and co™ 
trol), as identified by a projective instrument. appeared to be unrelate 
to the overt behavior of teachers as recorded by observers using the 
Withall categories. However, teachers’ “need” to control, as measured by 
a paper-and-pencil test, the Teacher Preference Schedule, was significantly 
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related to teacher controlling behavior in the classroom. Bowers and Soar 
(1961), using the Minnesota Multiphasic Personality Inventory (MMPI) 
as a measure of teacher personality. attempted to relate patterns of teacher 
personality to patterns of observed teacher behavior as indicated by the 
OScAR. They found the highest relationships between the psychopathic 
deviate (Pd). schizophrenic (Sc) and hysteria (Hy) scales of the MMPI 
and the OScAR dimensions of emotional climate, verbal emphasis, and 
social structure. Bowers and Soar concluded that teachers who (a) lacked 
maturity, depth of affect, and ability to feel personal loyalties (high Pd) 
and (b) were constrained, cold. remote. and inaccessible (high Sc) were 
less likely to have supportive emotional climates, extensive pupil-pupil 
interaction. or any great amount of pupil group work in their classrooms. 
Medley (1961). in a study in which he used the MMPI, found that teachers 
who had positive pupil-teacher rapport as indicated by high Minnesota 
Teacher Attitude Inventory (MTAL) scores tended to score high on the 
Hy and Pd scales. while low MTAI scorers were highest most frequently 
on the hypochondriasis (Hs) and depression (D) scales. There appears 
to be no consistent pattern of relationships in these three studies between 
the variables of teacher personality and teacher behavior. Lewin’s (1951) 
theoretical analysis of the relationship between personality and behavior 
would lead one to expect a rather predictable relationship between these 
two variables, A theoretical analysis combined with an empirical investi- 
gation may be necessary to account for apparent contradictions. " 
Additional References: Bowers and Soar (1962); Gnagey (1260) ; Keis- 
lar and McNeil (1959); Kounin, Gump. and Ryans (1961); Wallen and 


Wodtke (1963). 


Teacher Education 


pointed out that the variable of 
and identified by a trained observer. 
of elementary school student 
arly related to the overt 
f the classroom. whic 


Schueler, Gold, and Mitzel (1962) 
emotional climate could be categorized 
Using the OScAR, they observed three groups 
teachers and found that the variable most cle 


teaching behavior of student teachers was that o Dee end coopetitit 
apparently is made up of components of classroom cu 


i ident 
teacher. They found that the variable of college superi e. Vi 
teachers had little identifiable effect on student e em vss niis, such 
and Edson (1962, 1963), in an attempt to relate pre fom and counselors” 
as scores on the MT AI, MMPI, and Miller Analogies Test, 


ients based on interview with 30 S ive ele ntary ducation 
j V pr ctive Jementary € 

: I s I prospe 

1 between high MT 'AI scores and integra- 


students, found high relationships Flanders system and the OScAR. 
tive teaching behavior as measure by n ai me MTAI scores and domi- 
They did not find a high relationship between c brand diat eouniselors 
ORES smdenr teacher wee ^w jd ed integrative student 
" : Mv «ho later €* usb 

were niot able to: identify purent = interview and admissions data. 
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Kirk (1964) found that the teaching patterns of elementary school 
student teachers who had been taught the Flanders system could be 
changed. These student teachers encouraged more pupil-initiated talk. re- 
duced the amount of patterning involving teacher question and pupil re- 
sponse, and resisted the normal trend to increase the number of directions 
given to a significantly greater extent than did student teachers not taught 
the system. Kirk concluded that this difference was accounted for by the 
objective feedback which the Flanders system provided. Gage (1963) 
studied the effect of feedback on teachers and indicated that feedback may 
be the crucial element in an in-service training program designed to 
change teachers' classroom behavior. Taba and others (1961) suggested 
that as teachers begin to receive feedback about their teaching strategies 
they will have the opportunity to modify their teaching habits. 

Additional References: Flanders (1962); Flanders and others (1963) : 
Storlie (1962) ; Waimon (1962). 


Conclusions and Needed Research 


Within school classrooms there appeared to be definite patterns of 
teacher-pupil interaction which could be objectively observed and cate 
gorized. These patterns were apparently related to achievement, perception: 
and classroom climate. While there also appeared to be a relationship 
between teacher personality and teacher-pupil interaction patterns. there 
seemed to be uncertainties about the exact nature of this relationship. 
Perhaps additional theoretically oriented research on teacher behavior an 
teacher personality would help to clarify some of the relationships betwee? 
teacher characteristics and teaching outcomes. 

The application of teacher-pupil interaction research in teacher educa- 
tion programs appears to hold great promise for the improvement of edu- 
cation. The feedback provided by these observational systems seems d 
have significant influence on the behavior and attitudes of teachers. How- 
ever, much additional research is needed to determine the ultimate place 
of observational systems in the field of teacher education. 
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CHAPTER V 


Education for Disadvantaged Groups 


CYNTHIA P. DEUTSCH * 


In recent years, considerable attention has been focused on the stimula: 
tion and development of cognitive processes. Within this area, a o 
portant subarea has been evaluation of the varying etiologies of defectiv! 
intellectual performance. The work done here has led to a much piste 
sophisticated view of mental retardation and of poor academic paruo, 
ance than that which previously existed. There have been many studies o 
the learning process and of concept formation in retardates, aiming z 
define relevant variables. Observed results from these studies are relatec 
to cohesive bodies of theory. such as Piagets. rather than left to languish 
in the specialized literature of mental retardation. The over-all effect g: 
this work has been to bring the area of mental retardation much deeper 
into the educational and psychological mainstream. 

Review of the literature for the past period supports a generalization me 
the emphasis has shifted from the classification of mental deficiency an4 
a concern with institutional organization and community services to a 
greater concern with interventional and educational programs in order to 
minimize the effects of retardation. More and more. the retarded are being 
studied in terms of the abilities and skills that they have and that can be 
developed, rather than in terms of those that they apparently lack. Both 
these recent emphases—on cognitive processes on the one hand and on 
present rather than absent skills on the other—have led to ereater interest 
in educational programming for the retarded. Recent literature: both 
research and theoretical, reflects these emphases. P 

Foremost among the etiologies of retardation receiving greater emphasis 
is that of environmental deprivation. An early article by Benton (1956) 
deserves mention here as being one of the first in the recent period to 
present a cohesive approach to environmental determinants in retardation. 
Though. the term Benton used, pseudofeeblemindedness, has fallen into 
disuse, and though his emphasis was still on the interaction of emotional 
and cultural status with what he termed the “basic hereditary factor. 
the approach exemplified in that article can be considered the transitional 
one. Following that time. and as a result of greater scientific and socia 
concerns with the portion of our population labeled disadvantaged or de- 
prived, there has been a growing interest in evaluating the effects © 
environmental conditions on mental functioning. The emphasis here has 


" -— lair Falk for 
* The author would like to express appreciation to Ruth Mechaneck and to Clai 
their bibliographie assistance in the preparation of this chapter. 
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been both in terms of identifying the socially deprived and retarded (e.g.. 
Jensen, 1961) and in terms of a more global evaluation of the effects on 
intellectual performance of disadvantageous environments, (e.g.. John, 
1963: Deutsch, 1963). leading to attempts to reverse the effects of environ- 
These active and proposed intervention programs 
directed toward improving the active 
en and adults whose retarda- 


mental deprivation. 
parallel the educational attempts 
skills and awakening the latent skills in childr 
tion stems from neurological inadequacy. 
With the increase in the range of specialties concerned with problems 
of mental deficiency, there has been a corresponding increase in the num- 
ber of books and articles published. The literature in mental retardation 
is voluminous and disparate. As a result, no review of it is included here. 
Instead, only a few of the current books are mentioned; the literature on 
educational programs for the socially disadvantaged is covered more ex- 


tensively, though by no means exhaustively. 


The Mentally Retarded 


The monumental work of the past six years in the field of mental re- 
tardation, and the book that probably has most influenced the development 
of the field, is Mental Subnormality Þy Masland, Sarason, and Gladwin 
(1958). The authors reported a survey of the field of mental subnormality 
and divided the book into two major parts: the section by Masland dealt 
with research into biological factors in retardation; the oihan a pannon 
; i ural research. 

and Gladwin, treated environmental and cultural research i ded 
did not regard biological and environmental factors as ae E theie 
but as separate yet interacting etiologies. They also he inea disint eR 
attack on what they term a shibboleth: the idea of a eret ena idiota 
the basis for all mental subnormality" (p. Lauer mus. 
educational programs; they assumed that retardates p die edited by 

Among more recent books is Handbook of Mental Se ontinsing 
Ellis (1963). The distribution of chapters reflects ana aae intelli 
concerns in the mental retardation field (e.g-. perenne POS tes de. 
gence, and brain damage); also included are some ker je edits 
partures (e.g., the application of Piaget’s theory ro tn efforts 
deficiency, research on activity level). gs ym d = to educational pro- 
of social deprivation and contributions of psye 1961) resented a broa 
gramming. In Mental Retardation, Rothstein a 3 b discussed. educa- 
range of chapters, in many of which are descri 2 red. Educating Ex. 
tional programs and techniques for the mentally yar articularly relevant 
ceptional Children, by Kirk (1962), a book ts Lora exceptional 
includes a survey of educational practices vo ien : be a aes: 
children, including the gifted, the physically andicappee: 

F s the retarded. 


tionally disturbed as well a 
ona y 1stur. "n 


O O .!'é:t‘ 
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The Socially Disadvantaged 


The concern with the socially disadvantaged as a specific eg 
grouping and as a specific educational problem is relatively secent. a 
interest has been fostered by the increasing urbanization of our eee in 
tion and by the awareness of stratification and differential privilege arci 
our society. Henry (1963) pointed out that the middle class home contair : 
a “hidden curriculum" which enables the child to deal appropriately m 
his first school experience. The disadvantaged home contains no such s 
riculum and, as pointed out by Deutsch (1961a). represents a discontinut y 
with the school environment. Since the urban slum school also is it 
crowded and experiences greater turnover of both pupils and personne 
than does the middle class neighborhood school. researchers in the aren 
have been led to deal with school organizations as well as with the learning 
and response of the individual child. Further. since urban slum IBN 
include a high percentage of people from minority group backgrounds: 
desegregation and integration have become intertwined in the considera- 
tions of educational programs for the socially disadvantaged. oe 

Two general books that have contributed a great deal to the considera- 
tions of educating the socially disadvantaged are Intelligence and Exper 
ence by Hunt (1961) and The Process of Education by Bruner (1960). 
Hunt explored a great number of tacit assumptions about the immutability 
of intelligence. With a thorough review of Piaget's concepts buttressed b) 
implications from animal studies and a review of pertinent. child develop- 
ment research, Hunt attempted to establish the prime influence of exper! 
ence in the development of intelligence. This concept of the influence es 
experience underlies the institution of special educational programs = 
children from the slums, who are subjected to experiential deprivation 
varying degrees. Bruner supported special programs by emphasizing =“ 
"teachability" of almost all knowledge, if the correct approach is e 
Bloom (1961) further emphasized the mutability of intellective as We 
as other characteristics of the child. ; 

Education in Depressed Areas, edited by Passow (1963), deals directly 
with many different aspects of the problem of the education of the dis- 
advantaged child. Chapters by different authors are divided into sections 
on “Schools in Depressed Areas,” “Psychological Aspects of Education in 
Depressed Areas,” “Sociological Aspects of Education in Depressed Ateas 
“Teachers , for Depressed Areas,” and “School Programs in Depressec 
Areas." The book includes a substantial bibliography. : 

The emphasis in The School Dropout, edited by Schreiber (1961). is 0? 
the children that the school loses. This book. which is a collection g 
articles by different writers, is divided into “Societal Factors” and “Tmpli- 
cations for School Programs.” . 

Questions concerning integration are discussed in Integrating the Urban 
School, edited by Klopf and Laster (1963). While the papers included in 
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the book refer specifically to the problems in the New York City schools, 
the issues discussed are probably generalizable to any urban community. - 

The implication of the material in the books discussed so far is that, 
with proper educational planning. the deficiencies of the children from 
disadvantaged circumstances can be overcome. The emphasis has been on 
early compensation for the deficiencies in home environment. Riessman 
(1962) argued, however, that the disadvantaged are poorly prepared in a 
relative sense only and that educational programs can be begun at any 
age. Such programs, according to Riessman, should start with the real 
strengths that disadvantaged children possess in nonverbal communication 
and independence and should not rely on the middle class model. How- 
ever, Riessman’s argument on cultural relativism does not take into con- 
sideration the increasing industrialization of our society and the consequent 
greater dependence on verbal communication which leads to a greater need 
for facility in verbal skills such as reading and writing. 


Social Conditions to 


Relationship of 
Development 


Intellectual and Language 


“consistent class differences in language skills 
n groups of Negro children of different socio- 
hildren—children from more advantaged 
etter command of language in 
functions. The educational 
hasis on language teaching 


John (1963) reported that 
were shown to emerge betwee 
economic class." Middle class c 
environments—were reported to have a b 
terms of its classificatory and problem-solving 


implication of these findings is for greater empl > 
for disadvantaged children, an implication supported in a study by Jensen 


(1963a). In studying different verbal learning tasks he found that paired. 
associate learning involved greater reliance on verbal mediators than did 
serial learning. His approach offers the possibility ofa functional method 
of differentiating the socially disadvantaged from those children who are 
retarded as a result of other etiological factors. According to John (1963) 
and Bernstein (1960, 1962), the socially disadvantaged were less facile in 
the use of verbal mediators. The above studies open the problem of verbal 
mediation to both the diagnostic problems in low IQ findings and b 
educational problems in attempting to reverse the effects of cp o 
(linguistic) deprivation. The work of Kendler (1963), among others, is 


pertinent here. n — 
Intelligence tests are known to 2€ subject to ling 
n , in (1964 ted 
vironmentally determined factors. Pauna gad € I dias Do 
to d i s s of environment which contribute t g 3 
IR ey n divided into groups on 


In a sample of 281 first and fifth grade children di 
the basis ‘of race and social class. they found that children from fatherless 


homes scored lower on the Lorge-Thorndike IQ test than did children 


from homes in which the father was present. Children ats had no 
preschool experience also scored lower than did children with preschool 
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experience. The data they reported are the first from an extensive study 
of intellective and verbal factors related to socioeconomic status, race. and 
other individual and demographic factors. When the remainder of the 
study is published. researchers should have considerably more information 
and, simultaneously, considerably more problems to investigate. 

Klineberg (1963) reviewed some old and some recent studies of the 
relationship between environment and intelligence and came to a conclusion 
which is in agreement with that of Deutsch and Brown (1961). namely, 
that there is a “cumulative deficit” associated with disadvantaged environ- 
ments. As so many other studies have shown, children who are exposed 
to disadvantageous environments are subject to decline in intelligence 
score. According to Deutsch and Brown, this decline is evident by the 
fifth grade of school. 


Additional References: Goss (1961); Luria and Yudovitch (1959): 
Vygotsky (1962). 


Educational Programming 


The major emphasis in educational programming for disadvantaged 

children has been to initiate programs before the gap between these chil- 
dren and their more advantaged peers widens beyond reach. This emphasis 
has led to development of programs on the nursery school level. Nursery 
programs that are oriented toward the stimulation of cognitive development 
find general support in Fowler (1962) and in fenseri (1963b). Their 
psychological basis was specifically elaborated by Hunt (1964) and their 
usefulness explicitly posited by Deutsch (1964b). The emphases in both 
articles were (a) that the preschool years are important in building the 
basic skills which underlie cognitive development and (b) that the assump- 
tions which have guided the use of these years for noncognitive play 87€ 
themselves invalid and are particularly inappropriate for the education 9 
children subjected to deprived circumstances. 
. In general support of the same conclusion were papers which dealt with 
inadequate stimulation of perceptual skills (Deutsch. 1964) and with 
inadequate linguistic skills (John and Goldstein, 1964). Both papers indi- 
cated that slum homes are deficient in the stimulation required for ade- 
quate development of discrimination and of language skills. The two areas 
were bridged by a discussion of the relationship between auditory dis- 
crimination and linguistic and reading functions. 

It is too early to evaluate either the import of these theories and rec 
ommendations or the success of the programs which they have stimulated. 
The results of the preschool programs for retarded children discussed by 
Kirk (1958) and by Connor and Talbot (1964) would seem to indicate 
that the approach can be productive. Since a number of preschool pro* 
grams for disadvantaged children are now under way, it must remain for 
the next issue of the Review in this area to report and evaluate their 
success. 
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CHAPTER VI 


Reading Research and Instruction 


JOANNA P. WILLIAMS 


The present state of reading research differs greatly from that of 1958. 
Within this period there has been a renewal of interest in the theoretical 
analysis of the basic reading process. and recent contributions by psychol- 
ogists and linguists have been of great influence in focusing interest on 
basic studies. Considerable attention is also being given to the develop- 
ment of new and efficient techniques of instruction, notably within the 
context of programed instruction. Certainly, not all recent research has 
been centered around these developments. However, because these trends 
are new, influential, and promising, this chapter will focus on them. 
Other reviews have been prepared by Carroll (1964), Holmes and Singer 
(1964), Russell and Fea (1963), and Silberman (1963). 


The Nature of the Reading Process 


A divide-and-conquer approach to the problem of reading instruction 
has been adopted recently. If components of the whole task can be iso- 
lated, they may prove more susceptible to study and, hopefully, may lead 
to better methods of instruction. 

It is fairly generally agreed that the reading 
two distinct stages (Carroll, 1961). The child must 
symbols he sees on the printed page correspond in some way to his 
spoken language. The initial stage of reading consists of this decoding 


of orthography into already learned speech patterns. This ability is a 
necessary prerequisite to the second stage, in which the child actually 
comprehends the meaning of the message, since the meaning of the orthog- 
raphy is mediated by verbal responses, whether vocal or subvocal. Indeed. 
one might argue that the comprehension of av age involves the 


written mess 
same behaviors as does the comprehension of a spoken message and that ( 
this second stage is actually not a part of reading p 


er se. In any event. 
the task of decoding is seen as the primar 


process can be divided into 
first learn that the 


y task of the reader. 


Perceptual Learning 
t before decoding can take 


differentiate the written symbols, or graph- 


emes. that he finds on the printed page. On the basis of a developmental 
t certain features that differentiate among let- 


study, Gibson concluded tha 
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Gibson and others (1962a, b) suggested tha 


place the child must learn to 


BE 
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ters are also critical in object discrimination. The beginning reader ns 
already learned these and can simply transfer them to the reading r 7 
Other features are critical for differentiating between letters, but not «| 
differentiating objects (e.g.. rotations and reversals). and those are learned 
as children learn to read. n es of 

Gibson and others (1963) constructed a set of distinctive features o 
letters (e.g., whether the letter is symmetrical) such that each pu 
the alphabet can be described uniquely in terms of these pg ia 
predicted that the difficulty of discriminating any given pair of ves 
would depend on the number of distinctive features that the two let e 3 
shared. The relative dificulty of letter-pairs as reflected in the resulting 
confusion matrix was predicted quite well by her theoretical analys . 
Popp (1961) reported findings in substantial agreement with the analysis 
of Gibson and others. 

A recent experiment by Hendrickson and Muehl (1962) showed that 
pretraining that forced the child to attend to the directional iecit 
between b and d facilitated learning names for those letters in a Dl 
associates task. These data suggest that training to focus on critica 
features (at least the reversal feature) will indeed facilitate discrimination. 

One common recommendation of educators (e.g., Spalding and Spalding. 
1957) is that training include some tracing or writing; the extra kinesthetic 
feedback presumably facilitates learning. However, the above jientionen 
study by Hendrickson and Muehl indicated that children who were manes 
to make consistent motor responses to each letter did no better at learning 
to discriminate the letter than those who made inconsistent motor responses 
to each letter. Other data also indicated that, the addition of kinesthetic 
cues in training does not produce better performance (Otto, 1961; Ofman 
and Shaevitz, 1963). When some kind of kinesthetic training does make 2 
difference, the effect is usually small or limited. For example, Levin. 
Watson, and Feldman (1964) found that tracing graphemes improved per- 
formance on a paired-associates task. but only when the initial grapheme 
was traced. Usually, a kinesthetic method is effective only with remedia 


readers, perhaps because it is a different kind of training from that 17 
which they have already failed. 


8. 


Decoding 


The notion that the central task of reading involves learning the rela- 
tionships between orthography and speech has, of course, led to the idea 
that it is crucial to gear instruction to the development of these Lp panni 
phoneme correspondences. This task is not so straightforward as it mg 
seem: while English orthography is alphabetic in principle. there is ? 
great deal of inconsistency. at least when simple grapheme-phoneme cor 
respondences are considered. 1 
Recent teaching systems developed by linguists recommend not only 
that instruction stress correspondences but also that the beginning reade! 
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be introduced only to regular and consistent correspondences. Only when 
one sound is well established as a response to a grapheme should another 
sound be introduced. Bloomfield and Barnhart’s (1961) Let's Read presents 
the short sounds of all vowels first and does not introduce the long sounds 
until the short ones have been overlearned. Irregular spellings are classi- 
fied "according to the various types of deviation from the alphabetic prin- 
ciple" and are presented systematically. after the regular correspondences 
A similar approach was taken by Fries (1963), who 
nt of basic and meaningful contrasts between spell. 
ing patterns in whole words (e.g.. man versus mean versus mane). In fact, 
one of the most significant differences between these new linguistic methods 
and the older phonics methods is the extent to which the former stress 
regularity and consistency in the beginning of training. 


Another way of providing consistent. grapheme-phoneme correspond- 
been studied recently. The Initial Teaching 


ented Roman and now usually referred to 
bols, designed so that there 


are well learned. 
stressed the developme 


ences for beginning readers has 
Alphabet (formerly called Augm 


as ita.) is an artificial orthography of 43 sym 
is a one-to-one correspondence between sound and symbol. Preliminary 


data indicate that this modified orthography is a successful teaching 


medium. Downing (1961) reported that children who start with i.t.a, 
make significantly more rapid progress through a’ series of basal readers 
printed in i.t.a. than do pupils using ble materials in traditional 


compara 
orthography. The important question, of course, concerns reading achieve- 
ment in regular English after ita. training. In a comparison of seven 
matched pairs of schools about 18 months after reading instruction was 
begun, Downing found that pupils who first learned with i.t.a. read sig- 
nificantly more accurately and with more comprehension than children 
who started with regular written English. According to the teachers re 
ports, spelling, after the transfer to English, was "no worse than tenal, 

No mention was made, however, of the pupils’ skill at writing regu ar 
orthography after learning ita. Downing (1963) also reporte, = cmm 
a pilot study in which i.a. was used successfully in ae A ruction. 

An artificial orthography similar to ita. was developed by 


Sebesta 
(1964) and tested in a California first grade classroom. There hos i 
differences between children who learned regular orthography vm ihi 
beginning and those who started. with the artificial drea ted i | 
Gates Primary Reading Tests or on a test of familiar kom “ia Tan ad 
girls, trained on the artificial orthography scored SRR 2 

a test of unfamiliar words than did those trame 


the regular 
orthography. 
Levin and Watson ( 


d the assumption that only 


ed to be- 

regular grapheme-phoneme correspondences should j See S vfum 
ginning read They asked one group of third grade child l 

a a i h was associated with a 

a list of four words in which 2 single grapheme W: "WC à 

single phoneme (i.e.. the correspondences were consistent). es secon 

associated with two phonemes (ie. the corre- 


group, one grapheme was 
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1963) have challenge 
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spondences were variable). The latter group, while taking longer to learn 
the initial list, learned a second variable list faster. Levin suggested that a 
“set for diversity” is developed through concurrent training and that this 
leads to facilitation of later association to that graphic stimulus. This 
study is representative of the current trend toward small-scale laboratory- 
type studies. Such work permits tests of the effectiveness of different teach- 
ing methods under controlled conditions. 


The Critical Unit 


Is it indeed a good assumption that training should focus on the corre- 
spondences between letters and phonemes? It has been argued that sin 2 
the single letter does not have invariant acoustic match in English. jt can- 
not be the appropriate unit to consider (Gibson and others. 1962b). 
Rather, letter clusters have invariant relationships with phonemic patterns 
and form the critical correspondences between written and spoken lan- 
guage; these are the functional and critical units. Even without specific 
training, according to Gibson and her colleagues, and even if an individua 
cannot formulate rules about the correspondences. he discovers them as he 
learns to read. In line with these ideas, they found that after tachistoscopi¢ 
presentation, pseudowords that followed the rules of spelling-to-sound corre- 
lation were reproduced more accurately than those that did not. In another 
study, Gibson, Osser, and Pick (1963) found that children just starting tO 
read seemed to have assimilated certain simple rules; they could read tri- 
grams that fit such rules more accurately than trigrams that did not. 

An important attack on the question of the nature of these correspond- 
ence rules was made by Weir (1964). On the basis of an analysis 9 
spelling-to-sound relationships in 20,000 common words Weir suggeste 
that English spelling is related not only to phonology but alko to morphol- 
ogy and syntax. Indeed, when these relationships p taken into account. 
there is a considerably greater amount of regularity between written an 
spoken English than when only simple lester-tocenund relationships 87€ 
considered. Weir believes that the selection of eds io be üsed in teach- 
ing should include some consideration of grapheme-to-morphophoneme 
patterns. 

Lefevre (1964) was also concerned with the establishment of an appro 
priate unit to use in the development of instructional materials; but 3n * 
different way. In Lefevre's approach, the word itself is treated as à minor 
language unit. He believes that since words occur in groups within large" 
structural and meaning-bearing patterns, teaching “Teod stress these 
larger structural patterns. Until data concerning the effectiveness of this 
approach are collected, it will not be possible to judge the value of what 
would seem to require more complex learnings of a beginning reader than 
do most current programs. In any case, Lefevre has developed the first 
linguistic method that actually draws heavily on the content. of structura 
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linguisti s M restatement and reapp 
cs and does not sto 
8s stop vith a resta 
PI lication of the 


Comprehension 


While s x process. d there 

current emphasis is on the decoding process, it is clear that tl 

Is a need to consider comprehensi i ano ; 
sion in one form or anotl rithi 

e t 1 : nother within this 

context. Work to date indicates that the determination of the critical unit 


of corr ence wi si EI 
espond nce w ill depend on analysis of syntactic and emantic as 
) E a 


well as phonological structure. 
d ` 
ones and Carterette (1963) have found that when given free choice 


bois preier E that are less redundant (ie. more difficult) than 
ie E Eo m pis they are offered in school. It should be noted that 
ene i were usually books that the child would have read to him, 
i soe n iis own. Strickland (1962) found that oral language patterns 
on ie complex and flexible than patterns found in typical ele- 
nentary readers. This and other work suggest that perhaps the construc- 
tion of reading materials should be related more closely to the child's own 


language. 


Programed Teaching Systems 

he development of sight vocabulary, 
ord methods of instruction (e.g. 
d a phonics approach. Hope- 
based on some of the work 
integration of the two 


Early programs often focused on th 
reflecting the then-prevalent whole-w 
Stolurow, 1961). More recent program? use 
fully, the content of future programs will be 
previously cited; so far, there has been little 


approaches. 

A good example of th 
of an instructional program is see 
program designed to teach first gra 
combinations. An initial program W 
several pupils individually in a tutoria 
were developed on the basis of what seemed 
these sessions, and formal experimental comparison W 
the last revision and the original version of the program. T 
of the program were not as successful as were oral presentations by an 
experimenter. Silberman suggested that the experimenter may modify his 
pacing in response to subtle cues he gets from the subject. 

Given the current emphasis on individualized instruction and on pro- 
gramed instruction, one would expect to see a great deal of effort devoted 
to adapting programed instruction to individual differences. Little work 


has been done in the area, partly because of technical complexities (Sil- 
esearch concerned with individual 


berman, 1963). Most of the current T 
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e use of experimental an 
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ders to sound out 


F — ae 
Review or EDUCATIONAL RESEARCH I olume XXXI , No. 2 


differences was conducted in the more traditional classroom setting (e£ 
Balow, 1962; Grimes and Allinsmith. 1961: Preston. 1962). 


The Age for Beginning Training 


Many investigators using automated techniques and/or precisely pro- 
gramed training sequences have achieved success with children of an age 
below that at which instruction is typically begun (e.g., Staats and others. 
1964; Moore. 1963). Others have worked with retarded children whose 
mental ages were below those of the typical beginner (see the February 
1963 issue of the Review, pp. 126-36). In fact, Fowler (1962) was able 
to teach at least some rudimentary reading skills to a two-year-old, using 
very flexible training techniques. In answer to the apprehension that 
many people have concerning possible deleterious effects of early training: 
Durkin (1961, 1963) concluded from two longitudinal studies that pre 
school reading does not in itself lead to problems when the child enters 
school. It is significant that in all of these successful attempts at teaching 
very young children, some form of individualized instruction was used. 
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CHAPTER VII 


Mathematics and Science Programs for the 
Elementary School Years 


GLORIA F. LEIDERMAN 


In the past five years. coincident with the accelerated development a 
new curricula in mathematics and science. there has emerged a bale 
interest in learning and teaching of children’s mathematical and scienti sy 
concepts. The primary efforts of many of the curriculum groups in ien 
initial stages have been devoted to the preparation of materials and trair i 
ing of teachers. The point of view of major curriculum projects in iun 
tary mathematics and science will be presented in this chapter. with ee 
references to concept teaching and learning and a discussion of researc 
needed. 


Mathematics Programs 

The pressures for new mathematics curricula stem from mathematics 
research leading to advances in such fields as topology. abstract algebra. 
and theory of games as well as from the automation revolution (Price: 
1961). The contribution of mathematics to such diverse areas as seen 
technology, genetics, physics. and business production and distributior 
was cited by Brown (1961) as necessit | schoo 
mathematics. s 
Weaver and Gibb pointed out in the June 1961 issue of the REVIEW 
that although there has been an intense interest in “modern” mathematic 


: "inte o 
during the past few years. no definitive research in the development 
instructional media and programs h 


ating an improvement 1 


as been reported. 


School Mathematics Study Group 


The main purpose of the School Mathematics Study Group (SMSC). - 
described by Begle (1958), is to develop curricula and teaching mater 
for all students based on the best available knowledge of mathemati °% 
pedagogy, and the needs of our society, Since it is not possible to predio 
exactly what career any particular student will choose nor exactly petes 
mathematics skills will be important in the future. it is assumed Da 
teaching which emphasizes understanding, insight, and imagination wi! 
out neglecting basic skills is best for all students. ell 

The primary method of curriculum evaluation used by SMSG, as W° 
as by many other projects, has been to try out materials in classrooms a" 
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revise these on the basis of teachers’ reports and assessments. A testing 
program to measure mathematics achievement in fourth and fifth gratis 
cla using SMSG Mathematics for the Elementary School was reported 
by W eaver (1963). Mean gains from fall to spring testing on the SRA 
Achievement Series: Arithmetic showed that these groups equaled or ex- 
ceeded normally expected gains in terms of grade equivalents in both 
reasoning and computation scores. Tests were devised and administered to 
measure achievement in selected aspects of content emphasized in fourth 
and fifth grade SMSG texts. These test findings form the beginnings of 
normative data for SMSG materials. An Ideas and Preferences Inventory 
was used with this sample to measure attitudes toward arithmetic. Un- 
fortunately, in this study there were no control classes using more tradi- 
tional mathematics curricula for comparison of attitudes or achievement. 

A study using paired experimental and control fourth grade classes was 
reported by the Minnesota National Laboratory (1963). The experimental 
classes were taught from SMSG texts; the control classes were taught from 
conventional texts. The test measured achievement in traditional arithmetic 
from fall to spring. Although the differences favored the experimental 


classes, no statistical tests of significance of difference were reported, nor 
asure concept understanding, the 


were any tests given which might me 
major emphasis in SMSG texts. 

The National Longitudinal Study of Mathem 
1963) began studying 122.000 fourth, seventh. and tenth grade students 
in 1962. This is a five-year study designed to measure students in both 
conventional and new mathematics curricula on such variables as attitudes 
toward mathematics. reasoning. analytic skills. and mathematics achieve- 
ment. No findings have yet been published from this research. 


atical Abilities (Cahen, 


University of Illinois Arithmetic Project 


Project. under the direction of 
arily to stimulate children’s 
rsue mathematics beyond 
The philosophy of this 
oach to problem 
atical con- 


The University of Illinois Arithmetic 
David A. Page, has devised materials prim 
interest in mathematics and to allow them to pu 
the usual limits of elementary school arithmetic. 


group, according to Hohn (1961), is that a creative appr 
solving can be used at all ages to introduce significant mathem 


cepts and methods of thought. 
A description of the influence of 


was based on experiences of the Project s ‘derably hi 
Deans (1963) reported the þelief of this group that considerably higher 


proficiency in computation can be attained. coincident with the introduc- 
tion of new, genuine mathematical content. It is to be hoped that systematic 
studies will eventually be undertaken and published to support—or reject 


—this conclusion. 


Additional Reference: Page (196: 


this approach to teaching mathematics 


taff with experimental classes. 
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Madison Project 


Davis (1963) specified four major activities by ER Gm ee 
Project has contributed to the rational guidance of educational e un 
The first activity consisted of trying out ideas, seeing how they ^ or des d 
shaping plans which are in accord with the results of thrusts and prom e 
Included among examples cited were teaching the arithmetic o - d 
numbers to first and second graders and allowing children in grades y 
to work with algebraic identities and proofs of algebraic theorems. 
second activity has been to help administrators and teachers introduce 
new mathematics by providing them with books, films, and in-service 
courses. The third activity has been to devise creative mathematical expe- 
riences for children; the fourth, to work toward a theory of instruction. 
Since a primary aim of this project is to guide educational evolution, e 
would seem that this guidance should be based on some form of critica 
evaluation in addition to creative thought. In none of the publications 
of the Project which were reviewed, however, were there indications of 


what objective criteria were being used to evaluate the outcomes of any 
of the four categories of activities. 


Additional Reference: Davis (1963). 


Sets and Numbers Project 


More systematic study of new mathematics materials has been made by 
Suppes and his collaborators than by 


area. Suppes and Hill (1963) stated t 
Numbers Project is to provide a progra 
pedagogically simple, 


any other group working in this 
hat the intention of the Sets and 
m that is mathematically sound and 
stressing structure and foundations. The central con- 
cept in the materials is that of a set, because this concept is basic to the 
development of the idea of a number. Operations on sets are introduced 
as fundamental to the parallel operations on numbers. 

Suppes and McKnight (1961) analyzed errors made by four first grade 


classes on each chapter of the experimental workbook. On the basis of 
percent of errors made, certain topics were excluded from the qesiuihus 
materials to be tested the following year, and other topics were introduce 
earlier. 


Suppes and Hill (1963) reported a comparison of first and second grade 
classes using these materials with matched 

tional program. Results indicated that on the arithmetic portion of the 
Metropolitan. Achievement Tests, children using sets and numbers score 
as well as, or better than, children scored in the traditional program. 4 
tests were given to measure that content included in the experimenta 
materials which was not measured by this standardized test. 


bee à ized 
Several studies of concept formation in young children were summariZ 


by Suppes and Ginsberg (1962). These were well-controlled experiments 
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lodged within a theoretical model. Some of the major findings were these: 
bing learning was more efficient if the child. after making a mistake. ae 
et a m Tae es ir in E: presence of the stimulus to be 
(c) prior learning of p pen faepe a jobs ce a 

a g a related 


concept. 


Additional Reference: Suppes (196+). 
Greater Cleveland Mathematics Program 
leveland Mathematics Program has been to 


lementary and secondary curriculum that 
computational schemes (Deans, 


á The purpose of the Greater C 
evelop a logical, systematic e 


) ; E 
ia basic mathematical concepts before 
3). Extensive use has been made of the discovery approach to learning. 


The program has been experimental to the extent of trying out materials 
$ a large number of classrooms and revising them on the basis of feed- 
ack from the teachers. No published reports of specific evaluation pro- 


cedures were located. 


Medius of Concepts Utilized in 
ew Mathematics Programs 


red in the literature evaluating, through small-scale 
in various new mathematics cur- 


sess the effectiveness with 
n SMSG units when their 


ane reports appea te 
senis certain of the concepts utilized 
whi Ss Joyce and Joyce (1964) attempted to ass 

hich children could learn central ideas of certa! i 
ability to perform concrete and formal operations—in the Piaget develop- 
mental sense—was controlled. The secondary purpose of this study was to 
explore the feasibility of cooperation between school districts and univer- 
sity personnel in undertaking small research projects on the effects and 
appropriateness of the new curricula for young children. The major find- 


ings of this study were that although the fifth grade class scored signifi- 
s on a test of formal reasoning prior 


cantly low. a 

er than the sixth grade clas: r E 
to the SMSG, the two grades were not significantly different on ed eee 
test after the experimental units had been taught. Both classes = er 
nificant gains on a test designed to measure acquisition of the ce 


Concepts of the units. 
one (1963) reported the effect 
bis a number system with base five. 
i perimental group’s understanding 
ee he used two additional fourth grad 
* did not test these children on understan 
System at times equivalent to the pretests and 
mental classes. 
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The work of Dienes (1963) was concerned with the process ab et 
matics learning rather than with the learning of ees "iun epit: n 
concepts. His five experimental groups were composed of c x. rer el 
2-6. This work was based upon the theory that the learning : v- i AO 
concepts, as well as other cognitive structures. is achieved aes e Pon 
The book described the activities and complex games used in tez g 
certain mathematical ideas to experimental groups. 


Science Programs 


The task of bringing new knowledge to all students in w ays ampri 
to their interests, abilities, and future lives requires new "ie at 
was the point of view expressed in the introduction to Science — 
Improvement Projects, which described curriculum projects currently sp 
sored by the National Science Foundation (1961). In his recent review, 


A x . x : NE ne 
Atkin (1961) pointed out two major threads in curricular revision. In o 


: ffl : TT jnnova- 

approach, the content of a particular discipline is the source of inn A 

i a ; x eps luv serve: 

tion: in the other approach. a model of children's thinking ability se E 
ad Sie: ^ : ded n 

as the focus. In addition to reviewing studies on curricular developments; 


s : à A 2 E 5] sco? jien- 
Atkin reviewed studies of Instructional materials and of children’s sc 
tific concepts, 


Science Curriculum Improvement Study 

: , ed 
The philosophy of the Science Curriculum Improvement Study. as — 

by Karplus (1962). is that the function of elementary education is to gut 


the conceptual development of children by providing them with particu- 
larly informative and su 


ggestive experiences as a base for their abstrac- 
tions. At the same time, children must be provided with a conceptua 
framework that permits them to perceive the phenomena in a meaningfu 
way and to integrate their inferences into generalizations of lasting value. 
The testing of specific experimental units in the elementary grades was 
described, and evaluation of pupils learning was discussed. 1 
More recently, the study has been concerned with describing lessons anc 

ich combine discovery with expository introduction of cose 

by the teacher (Atkin and Karplus. 1962; Karplus, 1963; Thier. Powell. 
and Karplus, 1963). This project is of interest because of its attempt to 
develop a concept of science education based on the development of chil- 


ee ange : fee al 
dren’s thinking, utilizing discovery within a framework of conceptu 
presentation. 


University of California Elementary School Science Project 


The major activit 
Science Project h 
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scienc als " dicm 
cres i e ta en fon 
ee en sity of California. ? uch reliance has been placed 
experimentation by the children in de stratine FER areas 
oe i snails seks eei la scientific method 
ike EPUM of instruction during ‘the frst two years of 
subjective fe one de idis by Scott (1962). The data collected were 
Projeot em = = hers of pilot classes: subjective reports from 
D oy cate ota = who oliserned the instruction in the pilot classes; sub- 
irom E sito parents, school officials, and observers; and results 
ndo as " b ae ve m by the evaluator for appraising the 
NEN Rae i, : 5 T as children. Findings were presented both on individual 
ranis deoa MINE NEM These findings suggested a relationship 
t la : qn and the understanding of abstract concepts and the 
ips. Thie ie i ective teaching of the teacher s understanding of con- 
port, however, summarized the findings without presenting 


any systematic data or analyses. 


Illinois Elementary-School Science Project 


me. mew of the Illinois Elementary-School Science Project has been 
a ie aa coficepts of astronomy to elementary school children. The work 
vd oss ) phase of this program was reported by Atkin (1961) and 
Prat i by him in the June 1964 issue of the REVIEW. l 
Mages ems of curricular evaluation. within the context of the Project. 
delineated by Atkin (1963), and the kinds of evaluation information 
gathered by the Project were enumerated. Other investigations of these 
astronomy materials were reported by Pierce (1963). One study concerns 
effects on motivation when children are made aware of the conceptual 
framework of instructional materials as compared to motivation in a situa- 
tion in which the conceptual interconnections are less obvious and the 


child investigates for the sake of investigation. 


Commission on Science Edueation 


s of materials developed by the Commission on Science 
ion for the Advancement of Science 
ties which were expected to have a 
all scientific en- 


á The primary focu 
Education of the American Associat 


was the construction of a set of activi 
behaviors common to 


high probability of shaping those a 
deavors: the processes of science. Walbesser (1963) described the evalua- 
tive techniques built into the new curriculum. The strategy consists of a 
three-phase approach: (a) the specification of curricular objectives as be- 
haviors, (b) the measurement of each of these specified behaviors by 

ent of be- 


means of a check list of competencies. and (c) the measurem 
haviors that characterize the processes of science. 
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Combined Science and Mathematics Programs 


The primary aim of the Minnesota Mathematics and Science Teaching 
Project is to produce a coordinated science and mathematics curriculum 
for kindergarten through grade 9 (Rosenbloom. 1963). Research plans in- 
volve short-term evaluation of curricular materials for revision purposes. 
More rigorous evaluation will be undertaken when students graduate from 
the experimental preservice program. In addition. 


basic research in cogni- 
tive processes is to be undertaken, 


Conclusions: Evaluation and Research 
While the great intellectual and technologic 
have had their impact on curricular change, 
lent impact on educational research, 
ties, and materials abound in 
reports have failed to include suc 
tests or evaluative instruments u 
and the kinds of analyses from 


al advances of recent years 
they have not had an equiva- 
Although reports on objectives, activi- 
educational research publications, these 
h significant information as the names of 
sed, the tabulation of the data collected. 
a which conclusions were drawn. While 
systematic procedures may have been used for revising materials in many 
projects, the actual evaluations do not appear in the literature. However. 
this does not imply that evaluation in some form has been lacking. As 
Mayor (1962) pointed out, all of the projects engaged in the development 
of new course materials for mathematics at the elementary and junior high 
school level have provided for classroom tryouts, reports from teachers who 
have used the materials, and modification based on these reports. Mayor 


suggested that while this research has not been well controlled in most 
instances, it does represent a new and important kind of research in mathe- 
matics education. 


The problems of evaluation are currently receivine considerable atten- 
tion. The papers of Atkin (1963) D 
different positions on the 1 
clear statement defining 
ment activities was m 
the greatest servic 


on can play in course improve 
1963). According to Cronbach, 
Tm is to identify aspects of the 

3 did express the hope that evalua- 
y ing on particular courses and contribute 
to a general understanding of classroom learning. 

Research Problems in Mathematics Education (U.S. Department of 
Health, Education, and Welfare. Office of Education, 1960) delineated 
problem areas for future research. The major emphasis in these problem 
areas was the learning of mathematics and various factors related to it. 
such as perception. nonverbal awareness, and personality variables. 

In his review of research pertinent to science instruction, Smith (1963) 
suggested that the important questions for research are these: Which con- 
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cepts are mo: 
pee th st va i i 
E vieux div in the further intellectual. development 
dde. c pts contribute most to the cognitiv P flecti de 
Bec J 2 of science instruction? Eee SUR AU 
ause these p i e 
the influenve of program fe y relatively new. it is important to 
sites Ron he novelty o these curricula on motivati Ene 
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FOREWORD 


Research in problems of adult education has increased markedly, as has 
research about education in general. The developing maturity of the field 
suggests that topics such as learning, curriculum, administration, and 
philosophy be treated in the future in appropriate issues of the REVIEW 
rather than in a separate issue. Scholars in these fields will have to broaden 
their concerns and be certain that the field of adult education is included 


in their preview. The chapter organization of this issue reflects this 
recommendation. 


Wilson B. Thiede, Chairman. 
Committee on Adult Education 
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CHAPTER I 


The Adult Education Enterprise 


ROBERT C. CLARK and EDWARD B. SASSE 


The number of published research studies relating directly to the organiza- 
lion, management, staffing, and financing of institutions of adult education 
increased significantly during the past five years. A number of outstanding 
doctoral dissertations analyzed phases of adult education as developed. 
through the public schools, libraries, cooperative extension services, uni- 
versity extension services, and related services. Research personnel identi- 


fied with such disciplines as law, sociology. and economics chose to study 
bearing on the organization, 


problems in education that have a direct 

staffing, and financing of the adult program. Private research organizations 
and citizens committees were involved in an intensive assessment of specific 
adult education programs and have recommended significant changes in the 


enterprise. 

The purpose of this chapter is to summarize selected studies which repre- 
sent significant contributions to knowledge of the organization and admin- 
istration of the adult education enterprise. 

Johnstone (1963) prepared the first of a series of studies sponsored by 
the National Opinion Research Center about the nature of adult education 
in America. A probability sample of 13,293 households was drawn; infor- 


mation was obtained from 23,950 adults representing 11,957 of these 
vere developed from these figures. 


households. Estimates of national totals v e 
Activities classified as educational were those in which the main purpose 
was to learn or acquire some type of knowledge, information, or skill and 


Which was organized around some form of instruction. Johnstone estimated 
n some form of adult education 


that 25 million Americans had been active i u 
during the period from June 1, 1961, to May 31, 1962. Approximately 
{f study. Most of the 


one-third were engaged in self-organized programs © » 
activities were primarily noncredit activities, with strong emphasis on 


general education, vocational subjects; religion, hobbies, and recreation. 
More than one-half of the courses Were sponsored by churches eed 
gogues, colleges and universities, and community organizations. Govern- 
mental activity in adult education was concentrated in two niat are 
(a) vocations and (b) home and family life. Since the typical i ae 
was young and well educated and lived in an urban 2 ts uen iiia 
dicted that a tremendous adult education explosion will take place within 
the next decade, with emphasis on learning for work, learning for leisure, 
and li i 
E pr tributions of the research 
of selected disciplines to a actes 
169 


e con 


tudy of th 
ego d identified th 


dult education an 
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of the area. He combined an intensive review of the literature fiort Pph! 
subject matter areas with opinions obtained through pee epic 
selected persons of national stature in some discipline relate diera 
education or the applied field. Kreitlow identified the eight major oo m 
as adult education, business and industry, communications, psyc FE 
sociology, social work, vocational education, and education. He conc qe 
that the research needs in adult education fit into three p n 
application: (a) the adult as an individual and a learner, (b) the perc 
response to sociocultural phenomena, and (c) the adult —— n 
prise. The report included a classification system for research litera 
and a scheme of research needs in adult education. 


Public Schools 


Inroads into educational research by related disciplines are exemplified 
by the political scientists’ “discovery” of the politics of education. Masters, 
Salisbury, and Eliot (1964) designed their three-state study to look ; 
individuals, groups, and governmental agencies having a direct and tangible 
stake in the outcome of educational decisions, with the assumption that 
these would lead to the sources of power and would help to unmask the 
realities of political processes in this policy area. They concluded that re 
Illinois School Problem Commission was actually the arena in which 


decisions concerning the state’s role in education were made, that the 
Missouri State Teachers Association held a 
process of articulatin 
sional educators, 


thinking possible 


virtual monopoly over the 
g the policy demands of the public school profes- 
and that Michigan had no group to represent the best 
to solve the state’s educational problems.” The authors 
recognized the limitations of their study and presented their findings as 2 
posteriori for further research. It is hoped the study will bring the rela- 
tionship between politics and education to the attention of political scien- 
tists, professional educators, and others, 

Representative of the Syracuse Economics and Politics series was the 
work of Munger and Fenno (1962) , which dealt with the struggle to secure 
general federal aid for education. Research data were gathered through E 
review of the voluminous public record and from interviews with individ- 
uals involved in the controversy. 

The yearbook of the National Society for the Study of Education (1964) 
presented the developments that have occurred in educational administra" 
tion since the end of World War II. Reflecting interpretation as well e 
description, the new concepts were discussed under the following perceive 
changes: (a) administration is susceptible to empirical research; (b) ad- 
ministrators in all organizations are confronted with a common set ° 
tasks; and (c) administrators now view the human being in a different 
light. The text treated the background, present, and perspective bases oe 
administration in all organizations in representing the research and com 
bined thinking of the outstanding leaders in the field. 
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Libraries 


. Penland (1960, 1961) attempted to determine the image which public 
librarians have of the educational function of the public library and to 
decide whether attitudes held were strong enough to implement official 


educational objectives. Utilizing Guttman scaling techniques on 260 super- 
public libraries in Michigan, Penland found that 


he lack of commitment to adult education in 
e concluded that these librarians had not fully 
ducation, that they showed some re- 
dvances would be slow if left to the 


vising librarians from 33 
the librarian is the key to t 
American public libraries. H 
rationalized their position in adult e 
sistance to adult education, and that a 
typical administrator. 

Monroe (1963), in a study based on the analysis of historical docu- 
ments of three nationally known libraries, searched for the meaning which 
adult library education held for librarians in the United States during the 
period 1920-55. Monroe found a history of alternating confusion and reso- 
lution, but believed that the major causes for confusion have passed and 
that role consensus would now allow librarians to bend their energies to 


the fulfillment of the libraries’ responsibilities to adults. 

International Research Associates (1963a, b), commissioned by the 
American Library Association, examined the scope and extent of limited 
access to public libraries in the United States with particular reference to 
the problems of racial segregation in Southern libraries. Main focus was 
on the following aspects: (a) the extent and pattern of segregation, (b) 
the rate of change toward segregation, (c) factors tending to retard or 
promote change, (d) the role of the law in the segregation of public librar- 
ies, and (e) the attitude of librarians and library boards toward segrega- 
tion. The primary instrument of the study was a questionnaire mailed to all 
libraries in the United States. Supplementing this were interviews from a 
sample of 43 systems in 12 Southern states, a review of quise "X 
and an analysis of the location and resources of branch li piia in 
cities. The major conclusions were these: (a) direct discrimination was 


sag sq AT vas found throughout 
confined but indirect discrimination was 
to 16 states, discrimination as well as state 


the United States; (b) all forms of direct i 4 : 
statutes requiring AE were unconstitutional ; (c) ogo 
had proceeded faster in the library than in some other P " Ee a 
partly because of efforts of professional librarians; and ( A js cis 
library integration was affected by the low priority Pee 
leaders. Criticism of the study was focused on aspects O 
discrimination. . 

A study of the predominantly library-centered Great hae ee tm 
reported by Davis (1960) and by Davis and 7 E uds Le 
utilized a national sample of 1,909 individuals in 17 is m 
and was conducted through self-administered questionnai 


i iewer. The 1960 
Beet i :nion Research Center interviewer. 
oce. pip sea iom description of the participant and the 


report covered a comprehensive 
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effects of participation. Findings showed respondents to be pink ge enn 
socially active young adults of high status. The strongest clea E 
effect of participation was increased knowledge of the liberal arts edm 
humanities. The 1961 report concentrated on an exposition of factors t 
promote and factors that inhibit the retention of membership. Major ke 
clusions stressed that program retention was extremely complicated, but tha 
gratifications to members involved regularity of participation, social struc- 
ture of the group, and initially strong interest and preparation. " 
The complexities and trends of financial support of libraries were m 
cussed by Leland (1963), Drennan (1963), and Drennan, Kittle, an 
Winnick (1964). Leland concentrated on libraries in cities of medium size. 
His article showed the most important event affecting local revenues to be 
the increase of intergovernmental payments or transfers. He decried the 
lack of published statistics on sources and amounts of library revenue pid 
identified this area as paramount in future research. Drennan reviewe 
the costs of library service and concluded that, although expenditures for 
public library purposes have increased greatly in the past two decades, 
the per capita cost of such operations has not increased to the same 
degree. Continued urbanization leads one to believe that public libraries 
will command larger total outlays from local governments. Drennan, Kittle, 
and Winnick reviewed the entire range of federal programs in operation 
or being planned to aid public libraries in the attack on poverty. Individual 


programs were briefly outlined, and implications for improved library 
service were drawn, 


Cooperative Extension Services 


The cooperative extension services throughout the nation have undergone 
more study and appraisal of their organization, program, personnel, an 
operations in recent years than during any previous period. Most of the 
studies and investigations have been carried out by academicians in adult 


education associated, to some degree, with the Cooperative Extension Serv- 
ice and by private research agencies. 


A study of the Ohio Cooperative Exte 
Collings, and Wilhelmy (1964) 
independent research group w 
objective evaluation of the pre 
of the Service. They conclude 
variety of educational 


nsion Service was made by Fishel, 
through the Battelle Memorial Institute, an 
hich applied the scientific method to an 
sent and potential structure and functions 
d that (a) the Service provided a “wide 
; informational, and Specific services to adults an 
youth in both urban and rural areas" and that (b) “the most notable trend 
in clientele was the increased demand for a high level of specialized tech- 
nical information by primary agricultural producers.” The lack of a 
clear-cut definition of purpose and objectives was seen as an important 
underlying factor for many of the problems of the Service, Recommenda- 
tions for suggested changes in the organization or operations were con 
cerned with (a) appraising the purposes and objectives of the Service: 
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(b) creating a long-run plan of development, (c) maintaining the desired 
public image, (d) providing a more timely and specialized service, (e) 
initiating certain programs at the state level, (f) providing more specific 
and clear-cut policy guidelines to Service workers, (g) initiating experi- 
mental projects in different methods of communication and different struc- 
tural arrangements that might improve operations, and (h) establishing a 
Continuing Education Extension Division of the University. 

The Report of the Century Council by Texas A&M University (1962) 
was the culmination of approximately 14 months of study of the structure 
and programs of the University by a group of 100 outstanding citizens of 
Texas, The Council recommended that (a) consideration should be given 
to granting proper rank and status to qualified nonteaching personnel; (b) 
careful evaluation of both the need for and the costs of agricultural exten- 
sion education programs should be made by the University on a continuing 
basis; (c) closer working relationships should be developed between ex- 
tension education personnel in the agricultural sciences and research and 
instructional personnel; and (d) efforts should be made to modernize 
publications procedures, to write research reports in simpler language, and 
to develop the use of the mass media of communication for more wide- 


spread and effective dissemination of research findings to the public. 
as seen by various clientele groups, 


The image of cooperative extension, 
has been studied by Abrdlum (1963), Blalock (1963), Cavender (1964) , 
Greenwood (1963), Griffith (1961), and Kyd (1962). 

Even though all university extension programs have made, and are 
continuing to make, significant and lasting contributions to the | 
and development of the particular segment of society for which they are 


designed, clientele groups represented or reported on by these writers 
recommended a major re-examination of ray saurait ae 
services, professional competency of staff, and financia reso P sii 
Factors associated with the coordination of extension ark 
research and resident teaching functions in agriculture ih t (1961). 
nomics were the focus of research by Hyatt (1961) and Ric "usus 
These two researchers agreed generally that the major factors i cw us 
the coordination of research, teaching. and extension rac particu- 
following: (a) historical background of land gram Baer Pam hy of cur- 
larly colleges of agriculture and home economics; ( ) P ) x artmental 
rent administrative leadership toward coordination; PA veles to 
identification of subject matter specialists, es ei supervision, 
policies concerning employment, — he E determination of sala- 
budgeting, academic rank, joint appointments, nd evaluation of programs. 
ries; and (d) joint departmental staff planning à 


ion Services 


Two reports by Carey (19612, b) for the Caan r- Se eat 
Education for Adults were designed to bring ri SP Me 173 


University Extens 


PE 
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in the field of university adult education to the attention of i gg 
of adult programs, educational researchers, and others. ne : s 
qualitative case study approach emphasizing the gathering of data ad 
quasiclinical nature, one study (Carey, 1961a) dealt with the qe cim 
university evening colleges, the historical circumstances under whic| = 
university service developed, the dynamics of evening school peran ^ 
and the social relationships that permeate the performance of the tas : 
the evening school. Carey concluded that the evolution of university — 
sion services had required a coincidence of men with vision, adults i 
unfilled aspirations, and a university ready to meet the challenge. ‘ 
second study (Carey, 1961b), based on a questionnaire survey and inten 
sive interview in 18 institutions, was concerned with forms and forces in 
university adult education. This research pointed up the need for closer 
integration of the university’s community aim with its central purposes. 


Related Services 


Schwartz (1963) 
U.S. Army and Air 
Using data obtaine 
educational progre 
Economy measures 


traced the evolution of nonmilitary education in the 
Force from 1900 to 1960, with emphasis on 1948-60. 
d from official records and documents, he found that 
ss was of little consequence prior to World War I. 
in 1923 strongly de-emphasized education; this pattern 
continued until World War II. Extensive educational programs emerge 
and were made an adjunct of military life. Schwartz reported further that 
both services showed a continued trend for expanded programs based on 
the following assumptions: (a) morale would be improved; (b) boredom 
would be minimized, resulting in fewer discipline problems; (c) recruit- 
ment would be facilitated; (d) the increasingly complex military structure 
demanded better educated and trained personnel; and (e) a well-informe 
serviceman would be a better fighting man. . 

A study by Clark and Sloan (1964) was the third of a series concerning 


education outside conventional institutions of learning. Comprehensive in 
scope, it depicted the Armed Fo 


rces educational program as indicative o 
a universal trend toward educati 


lon in nonacademic channels. The authors 
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CHAPTER II 


Programming 


COOLIE VERNER and JINDRA KULICH 


design and management of instruc- 


This chapter is concerned with the 
A number of recent reviews have 


tional situations in adult education. 
been made, notably the important and exhaustive review of research on 
liberal adult education by Brunner and others (1959). Other summaries 
were prepared by Brim (1959) on parent education, by Dietrick (1960) 
on lecture and discussion, by Lionberger (1960) on the adoption of new 
farm practices, and by Rogers (1962) on all aspects of the diffusion of 


innovation. Annotated bibliographies were prepared by Schramm (1964) 


on programed instruction and teaching machines and by the National 
University Extension Association (1960) on correspondence education. 


Design of Learning Situations 


Research about program planning in adult education is sparse. Most of 
what has been done relates to the Cooperative Extension Service and is 
contained in graduate research reports. Stein (1960) analyzed correspond- 
ence course design and singled out four principles applicable to the 
method: (a) “the educational objectives must be appropriate to the 
medium;" (b) *the medium must be appropriate to the clientele;" (c) ‘the 
learning experiences prescribed must be appropriate both to the medium 
and the clientele;” and (d) “special care must be taken. to insure that the 
planned learning experiences offer sufficient. flexibility for an adult 
audience.” In a case study analyzing the experience of the University of 
Chicago’s Home-Study Department, Stein (1962) evol 


ved a formula for 
i isi i i urses. 
making curricular decisions in planning adult correspondence co 
eram design wer 


Psychological factors affecting program exe ro ered 
Lorge (1963) and Jensen (1963) . Lorge listed four — a nn 
tions for program planning derived from his study of cea differ- 
(a) within any group of adults, there are considerable n ec 
ences which are greater than in groups of children or yout j is en 
Situations should provide a sense of mastery and success oF nass and 
(c) adults should be provided with rewarding learning — 5 idi 
(d) every adult brings to any learning situation pond im cin 
are a great asset in teaching adults. Jensen (1963) ana is decis that 
of adult instruction and the types of sociopsychologica ae types of 
affect learning. He enumerated a set of principles Lupe venie ced 
interactions that develop in the instructional D we iM 
psychological conditions. (b) problem solving and !^ i 
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(c) decision-making interactions, (d) social-influence interactions, (e) 
social acceptance and personal evaluation, and (f) informal private 
interactions. . : 

There is little substantive knowledge about program planning to guide 
adult educators. This is an uncharted field in which research of any sort 
would be fruitful. 


Organization of Learning Situations 


The area of processes employed in educating adults has been a matter 
of confusion and conflict in terminology and constituent concepts that 
has seriously impeded the construction of a valid research design. In an 
effort to clarify this situation, with a view to providing a precise basis for 
research, Verner (1962) proposed a theory of adult education processes 
that drew sharp differentiations among the constituent processes that are 
precise and functional. He identified method as "the relationship estab- 
lished by the institution with a potential body of participants for the 
purpose of systematically diffusing knowledge." The method of education, 
therefore, identifies the way in which the potential learners are organized 
for purposes of instruction. Verner further identified three basic classifica- 
tions of method in terms of pattern of organization: (a) individual 
methods, “in which the relationship established by the institution with 
the potential learner is on a one-to-one basis," (b) group methods, in 
which a relationship is established "among the participants and between 
them and the agent so that the learning process is under continuous 
direction," and (c) community methods, in which the learning situation 


is constructed “within the context of the social system so that learning 
occurs in a setting of reality." 


Welch and Verner (1962) 
education group method (wo 
(bulletin) ; they used the 
measure of effectiveness, T! 
industry members in four 
significantly better than th 
superior to indirect influen 
menting the workshop with 
significant increase in ado 
education and adult self-ed 
for the former. 

A related study was conducted by James (1962) 
preference for method used. He asked 503 basic ai 
between learning by lecture and by reading. 


difference in scores on a posttest among those 
to learn from reading, 
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studied the relative effectiveness of an adult 
rkshop) and a mass communication method 
adoption of recommended practices as the 
he study was conducted among food service 
metropolitan areas, The group method was 
e bulletin in changing behavior; each was 
ces for the particular group studied. Supple- 
the bulletin did not produce any statistically 
ption. This study, in effect, compared adult 
ucation and yielded significantly better results 


to determine personal 
rmen their preference 
James found no significant 
who indicated they preferred 
those who chose lecture, and those who indicated 
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no preference. For the group as a whole, however, reading produced 
higher scores. 

The comparative effectiveness of adult residential and nonresidential 
education was studied by Lacognata (1961). Factors studied included 
the acquisition and use of knowledge about (a) attitudes relating to the 
insurance profession and to fellow students, (b) patterns of interaction, 
and (c) opinions of the program. On separate examinations for testing 
knowledge acquisition and knowledge application, residential students 
were superior. On the state examination combining the two dimensions, 
however, no significant difference was found between achievement of 
residential and nonresidential participants. Compared to the nonresidential 
adults, the residential participants showed significantly greater “profes- 
ng experience and did a relatively greater amount 


sionalism” after the learnin 
of studying out of class. Lacognata suggested that “the combined inter- 
and group influence were the 


related variables of isolation, continuity. 
n residential programs.” 


determinants making for superior performance i 
Although the relative effectiveness of lecture and discussion has been 


the subject of many studies at the college level, very few such studies have 
been concerned with adults out of college. In a comparison of the effective- 
ness of university lecture and lay-led group discussion offered at the 
same time to adults in the Los Angeles area and using the same content, 
Hill (1960) found lecture and discussion about equally effective. More 
specifically, he found no significant difference in effectiveness between the 
two approaches (a) in improvement in the ability to identify concepts of 


anthropology, (b) in attitude changes, and (c) in changes in several 
e differences between lecture 


areas of behavior. The study revealed large x 1 
and discussion participants with respect to friendship and community 
involvement; discussion had definitely more impact. The greater influence 
of discussion in this regard may have been due to the “community- 
centered” locations of the discussion groups and the higher emphasis 
placed on social expectations by these participants. . 
Bigger (1961) arce the sala meeting as a method of education by 
surveying the instructors. Of the 38 respondents a emma — 
mately 30 universities in North America, 16 were identified as exi 
primarily adult educators. A considerable majority were of the rt 
single presentation had some usefulness. Lecture-discussion was g 


out as the technique best adapted to this method. 


it of Learning Situations 


ult education, Verner (1962) identified 


two distinct elements in the learning situation: vn» i puc ee 
Techniques were defined as “the relationship oup " d wis n 
learning among a particular and precisely defined ol 7 hs € 
in a specific situation." He further suggested a typology for » 
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Managemer 


In his theory of processes for ad 
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tion of techniques on the basis of use: (a) to acquire information, (b) to 
acquire a skill, and (c) to apply knowledge. An alternative scheme that 
may develop after sufficient research would classify techniques func- 
tionally on a two-dimensional scale, with one side indicating degrees of 
learner participation required and the other the degree of abstraction in 
the learning task. The materials, equipment. or conditions involved in 
the learning situation to supplement instructional techniques were identi- 
fied as devices. In a later work, Verner and Booth (1964) suggested that 
devices perform four major functions: (a) illustrative, e.g., audiovisual 
aids; (b) extension, e.g., radio and television; (c) manipulative, e.g 
tools and equipment; and (d) environmental, e.g., seating arrangements 
of participants. 


Instructional Leadership 


In a preliminary report of a study concerned with teaching styles and 
learning, Solomon and Miller (1961) identified seven clusters of styles 
established through correlations of data gathered from interviews with 13 
instructors in various subjects. These clusters were then used in a larger 
study reported by Solomon, Bezdek, and Rosenberg (1963). In this study; 
24 instructors of a specific subject were studied, and measures of instructor 
behavior were obtained from the students, trained observers, and tape 
recordings. The data collected produced 169 items referring to the behavior 
of each of the teachers. A factor analysis of these items produced eight 
clusters of variables: (a) permissiveness versus control: (b) lethargy 
versus energy; (c) aggressiveness versus protectiveness; (a) obscurity- 
vagueness versus clarity-expressiveness; (e) OA of content- 
related (factual) student participation versus ANNOTA of 
participation; (f) dryness versus flamboyance; (g) encourssement o 
students’ expressive participation versus leonisine; and (h) warmth 
versus coldness, The findings of the study seemed io-dnüicate that. clarity- 
expressiveness and lecturing were most efficient in presenting an 
transmitting factual material, Comprehension, on the uir irand, was 
related to a “moderate position on the Permissivenesscontrol continuum” 
and to energy, aggressiveness, and flamboyance. Students evaluated most 
favorably those teachers receiving high scores on warmth and clarity- 
Class size was involved in a larger number of significant interactions 
than were any of the other variables. Students in large classes learne 
factual material best with teachers who emphasized student growth an 
who were permissive, warm, and flamboyant. Students in small classes 
did best with teachers who lectured, were relatively “dry,” and emphasized 
student factual participation, " 

In a study based on a representative sample of informal adult education 
programs using voluntary leaders, Liveright (1959) analyzed the adult 
learning situation and the variations of leadership styles he observec- 
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He found four factor: i zhi 

learning situation: (a). kind sl mis paupe a p s 
n of the leader, and (d) A of sed td wd d ks 
ee Aud | s nd materials. 
uris Anas ade ipee: ede present and other md 
ie g g rogram. Liveright o served that since lead i 
a eus of the learning situation, the leader had Wee 
ise € to the kind of program and kind of group. 
Me am 4 eadership style seemed to be related to a higher rate of 
panapa he roy Fuss his analysis of the dynamics of the learning 
eR | "eos arrived at five general conclusions: (a) an attitudinal 
dw Ae a ps seine: group calls for group-oriented leadership styles; 
pec Ke ess pompes in a group with little or no cohesion calls for 
E n = ead ership styles: (c) a primarily attitudinal program 
= $ gs p —— little or no cohesion probably requires the leader 
Slam b ually from content-oriented to group-oriented styles; (d) a 
Era as in a highly cohesive group requires the leader to use content- 
EM styles as much as possible while at the same time to give the 
gr p some group-oriented characteristics; and (e) in an understanding 
program in a highly cohesive group. the leader must satisfy the group: 
oriented expectations and at the same time lean “in the direction of 


content-oriented styles.” 


Specific Techniques 


Research studies related to instructional techniques for adult education 
a . . . 
re few, often poorly conceived, and generally inconclusive. For the most 
part, such studies as have been done are unrelated, and no logical synthesis 


is possible. 

Dunnette, Campbell, and J 
effectiveness of group and indiv 
research scientists and advertising personn 
with problems. Each subject brainstormed b 
man team. The findings indicated that indivi 
than groups produced, without sacrificing quality. 
individual brainstorming W hen preceded by 
in a group. 

Jansen and Stolurow (1962) conduct 
playing with female aides in a residen 
retarded. They reported the pattern of group 
the over-all effects of conditions used as (a) 
(b) imitating many roles, ) improvising aid 
improvising many roles. Imi own role, 
in producing larger number of iges,” was foun 
by the aides. 

Palmer and Verner 
discussion, and lecture-discussion 


aastad (1963) experimented with the relative 
idual brainstorming. Two groups of 48 
el respectively were presented 
oth individually and in a four- 
duals produced more ideas 
The superiority of 


as increased W brainstorming 


ed an experimental study of role 
tial institution for the mentally 
means that best represented 

imitating aide’s own role, 
es own role, and (d) 
although “most effective 
d to be least liked 


(c 
tating one's 
greater char 
e effectiveness of lecture, 
thin the class method at 
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(1959) compared th 
techniques wi 
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' involved 130 
Air Force Primary Pilot Training School. E ue . ae 
talleni M. "iati hysiology. On a test « i 
in six classes in aviation phy og obi o aea 
a Er a technique produced a higher oe E iol we 
did te de di sion technique. In assessing s 
i iscussion or lecture-discuss die E 
"i mm Palmer and Verner found that students ru ie E ss 
satisfactions, ; E | 
i anted a lecture fo : i 
-di ion method, (b) wan lec soins 
eid (c) wanted to participate but prm aos usi 
i on " : i 
si ot (d) “preferred a change of pace to break m iari 
of | vd r discussion only," and (e) "frankly admitted tha 
ecture or discus ; i ited 
rdinigue required too much extra work in preparation, 


Illustrative Devices 


Duvall and others (1960) 


eporting on a simi y shnical 
D rti similar study of tec 
training which used films or 


i “di ted 
slides combined with tape Se rem 

4 vt p 
that either device can be substituted for face-to-face instruction. 


in the study did not raise any objections to this means of prp 
Kreitlow and Edwards (1962) compared the effectiveness o “They 
bulletin, film, and television in reporting research to adult Leng sh | 
found lecture to be the most effective process. The bulletin was t 
to be more efficient than film or television, but not significantly be ih 
For all media used, the number of correct responses was related to 
ic grade leve] attainment of the participant. 


Environmental Devices 


A pilot study by Kennedy and Klein ( 
Seating behavior of large Soups as it rela 
cation or learning. The study 
(experimental 


ine 
1960) was designed to eae 
ted to effectiveness of esci 
Was conducted with two large gro 


i ini 8 ories. 
€ National Training Laborat 


j $ at 
* more consistent in their se 
ing to meeting; (b) those 


a T ore 
who arrived early were m 

3 (e) early arri 

e arrivers; and (d) 


en contro] 
n to large 


ings 
Vers tended to puce ee 
the data did not indicate à 


edie os ngth, 
group Participation, ego streng 
meetings, 


movement score, and reactio; 


Conclusion 


The development of s 


ystem 
and instruction 


alic researc 


i esign 
h relating to program desig 
al processes for adult educ. 


A the 
ation has been impeded by nity 
lack of a clear theoretical framework and by the absence of er 
"uk. c: i " 
in the identification and designation of the processes studied. As a res 
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studies reported in the literature showed inadequate controls in the 
research design. Most researchers have been concerned with determining 
the comparative effectiveness of one process over another, but too often 
they studied processes on different levels. Furthermore. the absence of 
precise definition and description of the processes examined makes the 
synthesis of research virtually impossible. Dietrick (1960) found so 
many different notions about what constituted discussion that the effective- 
ness of discussion as an instructional technique could not be ascertained 
in spite of the prevalence of studies purporting to do so. The theory of 
processes proposed by Verner (1962) suggested ways of establishing 
system and precision in this area of research that may lead to more 
significant research in the future. 
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CHAPTER III 


Evaluation of Outcomes 


WILSON B. THIEDE and JOHN MEGGERS 


Evaluation has come late to the field of adult education. For many years. 
Its justification was deemed to be its continued existence. Because most 
programs for adults are of relatively short duration and the curriculum is 
almost never sequential, evaluation attempts have relied heavily on par- 
Ucipant assessment rather than on measured behavioral change. In addition, 
much of the technical competence necessary for evaluation was housed in 
schools of education which, except for a few, were oriented almost ex- 
clusively to elementary and secondary education. Exceptions to this ab- 
Sence of competence and interest have been the supporting staff of the 
Cooperative Extension Service and a number of sociologists, the latter 
eng attracted to adult education in part because of objectives related to 
Social rather than individual change. During the past few years, the number 
of well-designed evaluations has increased. Because of this increase, Thiede 
and Draper (1963, 1964) included a category of evaluation in the annual 
bibliography of studies in adult education. 


Assessment by Participants and Leaders 


Kaplan (1960) completed a carefully designed study of 150 participants 
and leaders in liberal arts discussion programs in the Los Angeles area. 
His purpose was to determine the participants’ assessment of whether par- 
ticipation developed intellectual growth, civic participation, or continued 
study, More than 25 percent felt that they had gained “a great deal” more 
knowledge, and almost two-thirds reported they gained “some” more knowl- 
edge. In programs with the best textual materials, larger percentages 
reported gaining *a great deal.” More than two-fifths reported that they 
either read more than previously or read more serious works. More than 
one-half of the participants reported that they had changed some of their 
views. However, many members felt that little change had occurred. The 
discussion programs stimulated 48 percent of the participants to continue 
their educational activities in discussion groups or formal courses. Little 
evidence was found for greater participation in community activities or 
public affairs following participation in the programs. Participants who 
were not college graduates were more critical of leaders and more conscious 
of the absence of subject matter specialists than were those who were 
graduates. The investigators concluded that “the Liberal Arts Discussion 
Program for Adults performs a valuable educational function.” 
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From the results of a questionnaire study, Edwards (1962) reported that 
in the Knoxville High School Adult Program, 92 percent of those surveyed 
were working for a high school diploma; they liked the freedom of choice 
of courses; they felt the school library needed more space and materials; 
they had divided opinions on preferred methods and teacher qualities; and 
a majority indicated that they would not attend day high schools to com- 
plete their education. Similarly, Manley (1961) studied students and 
teachers of an extended day program of a state college in California. He 
found no significant difference in academic training between day and ex- 
tended day instructors, but a significant difference in distribution by rank, 
with fewer senior-ranked faculty members in the extended day program. 
Grading standards were not significantly different. The faculty felt that 
there was better academic achievement in the da 


the students concurred in this judgment, although 12 percent felt the 
extended day program was more ri 


students had better library access 


Measures of Knowledge and Information 
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Comparative Studies 


In the third of the major studies of discussion groups, Hill (1960) com- 
pared their relative effectiveness with the lecture method. He studied 12 
discussion groups of at least 20 members and one large lecture of more 
than 100 members. His evaluation procedures included a pretest and a post- 
test, interviews early and at the end, and observations of all sessions. The 
general subject matter was the same for all participants and was treated in 
11 two-hour sessions, each of which included a 30-minute dramatic record- 
ing. The same reading material was used by all members. The members 
evaluated the program favorably, but retained their original preference 
for lecture or discussion. There was no superiority of either method in 
ability to identify anthropological concepts. Neither method had much 
effect on the mean average attitude related to ethnocentrism, tolerance of 
ambiguity, democracy, or adult education. Kreitlow (1962), in one of a 
number of studies comparing methods of disseminating research informa- 
tion, found the lecture more effective than the film, bulletin, or television; 
the bulletin was the next most effective method. However, adults retained 
significantly more information presented by bulletin than that presented 
by the other methods. In a well-designed study using a knowledge test and 
attitude scales, Marsh and Hyman (1964) compared the effectiveness of 
programed self-instruction and a workshop. The subject was the use of 
radio in extension programs, and the participants were agents of the Co- 
operative Extension Service in North Carolina. The two methods were 
equally effective in increasing knowledge, in developing more positive at- 
titudes toward the use of radio, and in changing the agent’s concept of 
his ability to do radio teaching. 


Adoption of Practices 


One of the characteristics of adult education is the desire by the adult 
to have learning immediately affect life outside the program. A criterion 
that has been widely used in the Cooperative Extension Service is adoption 
by the participant of the practice or behavior desired. Hardee (1963) 
studied the effectiveness of an extension program in developing the re- 
sources of part-time farm families. He worked with 185 families, which 
were divided into two groups for analysis: (a) high integration and (b) 
low integration with the extension program. The program consisted of 
farm and home visits, tours, demonstrations, 4-H Club work, and use of 
mass media. Hardee reported that (a) the average value of sales of farm 
products increased substantially; (b) the median off-farm income increased 
(families with high integration had the lower increase); (c) family level 
of living increased more with the high integration group; and (d) the 
high integration group made more changes in farm and home practices. 
Marsh (1962) studied the effectiveness of extension programs in increas- 
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ing income and raising family living level over a fiv 
selected 50 families from each of eight counties. The e 
increased acreage from 108 to 136 acres, compared to i s T 
to 135 acres for a control group. The experimental group increased t E 
size of the livestock enterprise substantially, compared to the control group: 
swine increased 105 percent in the experimental group and only 57 percent 
in the control group. Value of farm products sold increased to a greater 
extent in the experimental group than in the control group. The experi- 
mental group made the greater increase in amount of food preserved. 
Porter and Wilson (1962), using a random sample drawn from a com- 
munity telephone directory, studied the impact of a consumer marketing 
program. The program consisted of a television appearance, a weekly 
newspaper article, four one-minute tapes for radio, and a two-page bulletin 
mailed to 600 persons, Consumer knowledge and behavior were both af- 
fected by the special marketing program. Those who recalled being exposed 
to one or more of the program items knew more about pork and its uses 


to have bought and served pork during the week 
following the program. Welch (1961) used i 
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adult curriculum. The two groups agreed on a number of criteria, such as 
objectives, achievement, requirements for diplomas and certificates, reasons 
for dropout and for re-enrollment, curricular flexibility, and influence of 
the curriculum on the community. 

Miller and McGuire (1961) began the definition of major objectives in 
four liberal adult education areas: political and social, community partici- 
pation, moral and ethical, and arts. Working with a number of adult educa- 
tors they proceeded from precise definition of objectives to the develop- 
ment of instruments. In each of the four areas they specified content 
objectives, behavioral objectives, and models for measuring the attainment 
of the objective. 
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CHAPTER IV 


Psychology and Learning 


HOWARD Y. McCLUSKY 


In spite of relative neglect, the psychology of adults continues to gain 
ground as a subject of systematic inquiry. The 1961 White House Confer- 
ence on Aging and the National Institute of Child Health and Human 
Development are illustrative of the growing investment of the federal 
government in this domain. Revised editions of textbooks in developmental 
psychology now give more attention than formerly to the full life span 
(e.g, Goodenough and Tyler, 1959; Vincent and Martin, 1961). Earlier 
longitudinal studies of children are now extending into the adult years 
(e.g., Kagan and Moss, 1962), and practically all gerontological research 
uses subjects in early or middle adulthood as a basis of comparison with 
performance in later maturity. 

Further evidence for the continued growth of the field may be found in 
reviews by Birren, Jerome, and Chown in the December 1961 issue of the 
Review, Birren (1960), Chown and Heron (1965), Kuhlen (1963b), and 
Williams, Tibbitts, and Donahue (1963) and in the publications of Brun- 
ner and others (1959), Knowles (1960), Shock (1963), and Tibbitts and 
Donahue (1962). A new textbook by Birren (1964), The Psychology of 
Aging, is by all counts the most comprehensive and thorough treatment 
now available in the field by a single author. 


Change in Psychological Abilities with Age 


Longitudinal studies provide the most convincing data related to change 
in abilities with age. These may be divided roughly into two categories. 
one involving a long interval and the other a short interval of years 
between testing. As an example of the long interval group, Bradway and 
Thompson (1962) administered the Stanford-Binet and the Wechsler Adult 
Intelligence Scale (WAIS) to 111 subjects at about age 30 who had for- 
merly taken the Stanford-Binet at the preschool and adolescent levels. On 
the average, mental growth increased beyond 16 years by about 11 IQ 
points, more in men than in women. The greatest increases occurred in 
tests of abstract reasoning and vocabulary. The gifted children studied 
originally by Terman are now in the middle years of adulthood and are 
still gifted, with above-average ability to perform on tests of conceptual 
thinking (Terman and Oden, 1959). 

Perhaps the most convincing data of the period are those contained in a 
follow-up study of adults who were tested for the first time at about age 20 
with the Army Alpha Intelligence Test (Owens and Charles, 1963). Re- 
tested for the second time at about age 61, the subjects maintained their 
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earlier level of performance, with a decline only in tests of numerical 
ability. In the first follow-up, when his subjects were age 50, Owens (1959) 
found that “increments and decrements in test scores from age 20 to age 50 
were roughly comparable at all levels of initial ability.” 

Short interval studies have more often used subjects aged 50 and over 
and, as in the case of the long interval research, have shown little decline 
in performance. In an investigation of 165 persons over 50 years of age. 
little over-all decrease in the full-scale WAIS was revealed after a three- 
year interval (Eisdorfer, 1963). Tested first on the WAIS and retested 10 
years later on the same test, 28 men maintained ‘ 
IQ's (Duncan and Barrett, 1961). 

The results of cross-section 
results of longitudinal research 
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al content show decrease with age, while 
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change increases with age. In a recent and thorough review of the litera- 
ture, Chown (1959) pointed out somewhat paradoxically that “rigidity is 
a very flexible concept." In a later investigation. generally regarded to be 
one of the most penetrating analyses of the problem. Chown (1961) 
demonstrated that rigidity should not be regarded as a generalized trait 
but as an attribute of behavior which varies with the nature of the task 
and that it is more significantly associated with low intelligence than with 
age. In one study. Botwinick (1962) discovered that in a structured situa- 
tion providing a "set to shift." older subjects performed with minimum 
difficulty ; in another, Botwinick and Brinley (1962) concluded that older 
people give optimum performance if a task is clear and time is allowed for 
monitoring and correcting potential errors. Self-pacing may contribute to 
this process: at least it improves performance. At the same time. in certain 
circumstances, some resistance to change may be an advantage: for ex- 
ample, Warren (1961) found that older subjects were less inclined than 
younger subjects to distort words in response to a tape recording. 


Learning and the Learner 


Even though adult educators may regard rote learning as an unrealistic. 


if not unfair, measure of adult learning ability. it is interesting to note 
the conditions under which it improves. A longer period of exposure was 
found to lead to improved verbal performance on the part of persons over 
60, even after it brought about no gain for persons between 20 and 50 
(Chown and Heron, 1965). Canestrari (1963) reported that self-pacing 
led to faster learning in older. but not in younger. subjects. In both cases. 
Omission errors decreased. Parenthetically. it is provocative that rote 
learning in the very old may offer a fair prognosis of the number of years 
à person has yet to live (Sanderson and Inglis. 1961). 

After interviews with 22 highly involved adults, Houle (1961) classified 
his subjects into (a) goal-oriented. (b) activity-oriented. and (c) learning- 
oriented learners. Taking off from Houle’s typology and equipped with 
responses to questionnaires administered to persons attending various adult 
education conferences, Sheffield (1964) divided Houle's goal-oriented 


category into personal-goal and societal-goal orientations. and his activity- 
orientated category into desire-for-sociability and need-fulfillment orienta- 
ion correlated. positively with 


lions. Sheffield also found that participat ; 
learning and personal-goal orientations. Ingham (1964) found four leisure 
types, while Brown (1964) discovered that alumni of highly rated colleges 
were more likely to pursue educational activities after college than were 
alumni of colleges with average ratings- 
In one of the most convincing studies 
chapter, Wolfe (1963), using a test of adult adjustment. 
and Attitudes, dichotomized a large number of persons over 65 into two 
groups, one well adjusted and the other poorly adjusted. In practically 
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every type of adult education activity examined, he found that bsp 
adults had a much more impressive history of participation than id Uh 
poorly adjusted adults. This was true for those having had only an c ! 
grade education as well as for those who continued in school long afte 
graduation from high school. 


Attitudes and Emotions 


Large numbers of people in our soci 
the last third of life, according to Axelrod and Eisdorfer (1961) and 
Eisdorfer and Altrocchi (1961). Even Some gerontologists share this 
attitude (Arnhoff and Lorge, 1960). However, Tuckman, Lorge, and 
Zeman (1961) found no loss in the intactness of self-drawings with age. 
Other studies show self-depreciation associated with age (e.g., Bloom. 
1961); some show ambivalence in self-attitudes (e.g., Werner and others, 
1961); and others disclose a disposition on the part of many adults to 
consider themselves younger than their chronological years (e.g. Zola, 
1961). Wallach and Kogan (1961) found older men having less self- 


ety hold a negative attitude toward 


oth sexes psycholo 
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while the younger adults felt that they had time for replacement. This dif- 
ference in the reaction of the two age groups points out the extreme 
importance of the time dimension in assessing the outlook of adults at 
various stages in the life span. 


Change in Personality with Age 


Although preceding sections dealt with selected dimensions of person- 
ality, a number of studies embodied a more global approach to personality 
change. In one study of hospitalized men aged 22-76, Edwards and Wine 
(1963) reported that 11 of 41 personality variables correlated positively 
with age. However, these correlations approached zero when the groups 
were equated for intelligence. Reichard, Livson, and Petersen (1962), 
using psychoanalytic concepts, conducted extensive interviews with 42 
retired and 45 soon-to-be-retired men. The results indicated that the pre- 
retirement period was characterized by instability and a sense of insecurity, 
while the postretirement period was noted for a reduction in defensiveness 
and the development of a favorable view of the earlier stages of life. A 
factorial analysis of the data disclosed three groups of successful adjusters 
and two groups of unsuccessful adjusters. In the former category were (a) 
the mature group, described as having a constructive approach to life, 
(b) the rocking-chair group, composed of highly dependent persons, and 
(c) the armored group, possessing well-developed defenses against anxiety. 
In the unsuccessful category were (a) the angry group, hostile and often 
blaming others for their shortcomings, and (b) a small cluster of self- 
haters. -— " 

By all counts the most systematic and significant effort in zu pun is 
the project known as the Kansas City Study of Adult Life con xg a a 
research team from the University of Chicago (Nengezten, F Ao i 
1964). In general, the studies in this project may um seeded doin 
categories: (a) research dealing with the tocar cin and qum A 
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Neugarten, 1964). In the intrapsychic group, sex differences increased wi 
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age with older men, who were more receptive than younger m pale 
“affiliative, nurturant and sensual promptings," and with older n 2 
Who were more acceptive than younger women of their M ca educ a 
egoistic impulses." As indicated above, however, in the socioad pore 
category; such variables of social adjustment as integrity (Gruen, ioe 
adjustment (Peck and Berkowitz, 1964), and cognitive rope Te 
garten, Crotty, and Tobin, 1964) were not related to age, and ot m $e 
aged and older persons were found distributed throughout the integrated, 
defended, and unintegrated types (Neugarten, Crotty, and Tobin, 1964). 


Theoretical Approaches to Change with Age 
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disengagement accounts for all of the phenomena associated with the 
passage of time from 40 years to the end of life. 


Conclusion 


cent psychological production he is 


When one scans the: terrain of re 
ssed with data show- 


impressed with a number of items. First, one is impre 
ing the great range of individual differences that increase with age. So 
extreme are the differences that one must be very cautious about the kind 
of claim he makes for the validity of his generalizations. Second, and a 
consequence of the first point, data from various sources are providing a 
growing case for a differential psychology of adults. Already it is clear 
that the pattern of abilities increases in difference from adolescence through 
early adulthood and on into the middle and late years. Moreover, we cannot 
assume equivalence of stimulation and motivation in these successive stages 
of change. Third, more research is greatly needed, but it must be conducted 
with concepts and instruments that are most relevant to the unique features 
of the adult condition. For example, more work needs to be done on an 
appropriate criterion of adult intelligence, on "age fair" tests, and on 
devices that get beneath the surface of the adult personality. Fourth, there 
is a serious lack of research dealing with the lower class adult. This is an 
important area of inquiry in its own T but will become increasingly 
important as the implementation of recent civil rights and economic oppor- 
tunity legislation gains momentum. Fifth, the problem of adult pere 
is in part a problem of the conservation of human resources. Already the 
limits of the productive years are moving upward in both theory and 
practice. As indicated earlier in this chapter. overcoming the discrepancy 
between curves of problem-solving ability and general intelligence would 
represent a great gain in adult performance. Finally, we are "rm Leeds 
confronted not only with discovering talent and creativity for the first t 
but with giving both an opportunity to develop through as long a period o 


the life span as possible (Botwinick, 1964). 


ight, 
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CHAPTER V 


Socialization 


GALE JENSEN and ALBERT SCHRADER 


ate rp :hological processes by which the human being becomes 
P Gr lave long been of interest to behavioral scientists and educa- 
" pei rii aor Most of the theoretical and research work pertaining 
2 c: si nt p gnome, however. has been devoted. to socialization 
different n E particularly the child rearing and training practices of 
] communities. social classes, and cultures. In recent years, de- 
velopment of industrial societies has begun to force researchers to focus 


some ¢ : P d 
me attention upon the socialization of adults. 
ular stress the significance of adult so- 


adults whose social 


. Three developments in partic 
Vra in these kinds of societies. First. many 
Sedi as children took place in rural communities and small towns 
lave moved to metropolitan and suburban communities. Much of the 
social learning in which their adult social lives are grounded is inadequate 
and obsolete. Second, technological and industrial. developments have 
greatly increased the rate of reorganization of social environments. Since 
radical kinds and amounts of social reorganization take place within the 


Spa H fos * t H . ja i —" 
pan of an individual's lifetime, adults must acquire new social learnings 
with the problems of their communities 


and societies, Third, the extraordinary development of modern formal 
Organizations has created a situation in which most adults invest the 
major part of their time and energy in the ventures of such an organiza- 
lion. Since an individual's personal and family welfare depends upon 

ganiza- 


how well he is able to function within these conditions, formal org 
dual to acquire the necessary 


lions generate powerful forces on the indivi i 
social learnings. Consequently, adults may undergo a kind of socializa- 
tion that is beneficial to the formal organiz function 
but not to their families and communities. 

Theoretical and research studies that are identified as adult socialization 


studies are seldom found in the behavioral science periodicals. However, 
ide data about the kind of be- 


there are a number of studies that prov! d 
havioral changes adult subjects undergo in different kinds of social situa- 
tions; the theoretical and research literature reviewed in this chapter is 
based on studies of this kind. Since the adult spends a great share of his 
energy and time within formal organizations. special effort was given 


to locating studies that provide data about the kinds of social learnings 


that result from such participation. 
ing the literature first required the 


The procedure employed in review e 
perusal of diverse journals in order to locate articles that provided theory 


201 


if they are to cope productively 


ations in which they 
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and research data about the social learning or d pipes 
changes manifested by adults in social situations. The d en 
to organize the articles and books so identified into a P ia den rd 
raphy from which studies of particular interest con er à le Se e 
more intensive examination. A conceptual framework, — hice 
the different studies in a common way. provided a means to ek ear 
social situations or environments to identify the structural. s 

interaction, and cultural influences on the 
The object of this analysis was to 
conditions. An attempt w 


adults involved in the studies. 
identify the operating psychological 
as then made to determine the kinds of cognitive, 
affective, and motor responses the subjects made when trying to woe 
pate effectively under these conditions, Finally, the reviewers attempl g 
to identify the kinds of social learning that emerged from this experien , 
in terms of perceptions, affective attitudes, motivations, social skills, an 
social values for governing behavior in similar situations. 


Motivations 


In a longitudinal study involving 6.51.1 h 


Astin (1963) discovered that the motivation to earn a Ph.D. degree 
increased in those students who attended coeducational liberal Hits 
colleges, but decreased in those students who attended northeastern men's 
colleges. This Suggests that the organizational cultures of the different 
colleges might produce changes in Motivations for further academic 
degree work, McDill and Coleman (1963), in a study of the influence 
of high school Social status upon acquiring motivations (a) to attend 


college and (b) to attain high academic achievement, found that students 
of high social status had acqui 


ae H B ing 
red positive orientations for attending 
college and negative orj i 


: s F These 
i ud for high academic achievement. The: 
findings intimate that the prestige structure of 


Motivations for and levels of pe 


964. i j : 5 n : : t 
Strauss (1964), using purchasing agents as subjects, identified a se 
of Sociopsychological conditions which seemed to lead to learning mo- 
tivations for professionalization, It would appear that whenever persons 
trying to increase their prestige and Power come into conflict with the 


different working groups of an organization, they turn first to upper 
management for Support; if they fail to obtain 
are attracted to profession 


igh-aptitude college students, 


ee i s 
an organization produce 
rsonal achievement. 


n 
3 such support, they pet 
alization as 4 means of improving their self- 
images. 
D — r s : - 
Guest (1962) reported on à case study in Which changes in the au 
thority and communications s 


; large industrial plant brought 
about changes in the inates to initiate interactions 
with their superiors, young adults by Gruber and Weitman 
(1963) gave evidence that motivation for accepling and directing intel- 
lectual development was dev 


eloped by ch 


anging (reducing, in this case) 
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the amount of class time during which instruction was directly under the 
control of teachers. 


Perceptions 


Utilizing both professional and subprofessional staffs in hospitals, 
Haas (1964) discovered that the degree of perceptual consensus about 
the nature and function of their work roles was inversely related to the 
amount of friction that existed between them. These findings strongly 
suggest that some kind of socialization is at work which produces a lack 
of perceptual agreement about the functions of different roles, which in 
turn produces friction between the roles to a degree commensurate with 
the lack of perceptual agreement. Jensen (1962) found that when the 
perceptions learned about the authority relations between roles were in 
high disagreement, the decisions made about hospital operation were 
disruptive to a large number of role occupants and produced frustration, 
a reduction in job performance, and sometimes a drop in work motivation. 
These consequences seemed to reinforce the learning of misperceptions 
about authority relations, thereby creating a vicious circle. It would seem 
that when hospital personnel acquire incompatible perceptions about 
authority relationships between roles, they behave in a way that violates 
the criteria for therapeutic conditions for mental patients. Pearlin (1962) 
studied certain sociopsychological conditions that produced alienation 
behavior in nurses with respect to their work. In particular, he found 
that alienation from work was produced whenever the authority structure 
of the organization was such that (a) great positional disparity existed 
between superordinates and subordinates; (b) authority was r 
in a peremptory fashion; (c) persons in authority were pe y inac- 
cessible; and (d) extrinsic rewards were unsatisfactory, and little oppor- 
tunity for attaining a sense of achievement through work us de 

Parker (1963) sought to identify the sociopsychological o en PE 
of organizations that had some effect upon the social gru gà “7 
would acquire by participating in them. He identified three bp 
characteristics: performance motivations, leadership sie ie E 
autonomy. One interesting. finding with respect to per pipe eo 
tivation was that the sociopsychological basis on Noes ote aps seed 
rested was the perceptual learning. termed “job performance 


mental or related to job security.” In m RU the base rn 
ivati i i reanizati the perce g 
iy the organization produced the nir 

Mec qus en " l to job security,” the productivity 


that “job performance is instrumenta 
of the organization would fall. 


Attitudes 


Porter (1962; 1963a, b, c, d) examined a 
cs of organizations with respect 
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In a series of studies. 62; 
number of sociopsychological characteristi 
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to the kinds of affective attitudes that develop in adults from this m 
tion experience. In a study of the formation of job satisfaction pmo 
by managers, Porter (1962) suggested that managers develop posti : 
affective attitudes toward their jobs when they perceive themselves at : 
vertical level within the management structure that is commensurate W i 
their needs for self-actualization, autonomy, and esteem. Stated proposi- 
tionally, the findings seem to say that managers have strong needs for 
self-actualization, autonomy, and esteem; when they are able to attain 
managerial positions that can fulfill these needs, they will develop favorable 
affective attitudes toward their jobs. Porter (1963a) showed that higher- 
level managers placed relatively more emphasis on self-actualization and 
autonomy needs than did lower-level managers. This raises the extremely 
interesting problem of whether the managers achieved these higher-level 
positions because of stronger self-actualization and autonomy needs or 


whether the intensity of these needs is a product of socialization. Porter 
(1963b) examined the difference 


line and staff managers as provide 
was more important to staff man 


of management 
findings sugges 


e patterns of aff, 
y Porter (1963d) indicated 


- In terms of a number of 
Job experiences, large-company 
ent "affective Profile" than small-company 
onse profile of large-company managers was 


: intense, complex, profound, challenging, and 
h € attitudes of small-company managers with 
job experi vere “bori 


informal, superficial 8: bad, easy, relaxed, simple, 


: ; and c ive." thing is 
clear from this study aur crue" Ong į 


: the socialization ro i i a 
N H " Vio 
quite different and Produce quite differ per eee = 
(1962) findi g 


> 
rent social learnings. Vroom’s 
of ego-involvement in 


29 . They indicated that the amount 
: : a person's job functions and the resulting satisfac- 
tion with self, others, and job were related to the amount of self- 
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problems. Their findings suggested that high grievance feelings and job 
turnover were associated with organizational environments in which 
arrangements for leadership were highly structured and little considera- 
tion was shown for the problems of employees. A study by Argyris (1962) 
which examined the interpersonal competence of executives, tended tö 
support the findings of Fleishman and Harris. Both Yanouzas (1964) and 
Argyris (1964) produced works which indicated that the formal work 
structure in which people find themselves acts as a strong influence upon 
the kinds of affective attitudes they will develop about their work situa- 
tions. It is clear, also, that when the work structure is highly planned 
and produces a depersonalized psychological environment, behavior which 
is detrimental to both the people and the organization begins to emerge. 


Social Skills 


Houton (1963) suggested that adults learn actively and consciously 
to manipulate their relations with others when sociopsychological en- 
vironments are unstable and subject to rapid change. In a study of the 
effect of school size upon participation in extracurricular activities, Barker 
and Hall (1964) stated, “We conclude that the Seniors of the small 
schools reported more extra-curricular activities than the Seniors of the 
large schools by a factor of over two to one.” Since the over-all research 
project from which this report derived its findings involved some 38,172 
high school students, the findings of the study cannot be viewed lightly 
in terms of the possibilities that the socialization that takes place in 
different-sized schools may possibly have some effect upon the kind of 
participation skills and motivations that are learned. Gump and Friesen 
(1964a, b) examined the amount of participation and degree of satis- 
faction that high school students of different-sized schools experience. 
In the latter report, they stated that “Juniors from the small schools re- 
ported more satisfactions relating to competence development, to chal- 
lenge and big action, to group action, to ‘uplift’ values; large school Juniors 
reported more satisfactions dealing with vicarious enjoyment, with large 
entity affiliation, with learning about their school’s persons and affairs and 
with gaining ‘points’ via participation.” This shows clearly that the social 
skills being acquired through socialization in the different-sized socio- 
psychological settings were quite different; one wonders which set of 
skills will prove best for the welfare of American society. Campbell's 
(1964) study of the effects of high school consolidations upon the behavior 
students acquire through socialization showed that the social skills and 
motivations of students underwent a change with transfer from a small 
local high school to a large countywide high school. Students experienced 
a decrease in personal responsibility associated with extracurricular activi- 
ties and a decrease in a number of satisfactions associated with physical 
well-being and with development of intellectual interests, self-concept, and 
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zest for living. This study not only provided an insight into the ice 
in the kind of social learning that takes place in different pu G : T de: 
settings but also, in light of some of the social and health pgi em pon 
American society, raises the question of whether many of these E 
that have evolved without much examination are developing a rd 
ings which leave adults behaviorally incapable of dealing with family. 


> e and 
community, and work problems and consequently result in family and 
personal breakdowns. 


Values 


In a study dealing with the r 
and the life-work values considered 
Boggs (1963) discovered that tech 
what reverse set of values w 
should derive from work. Tec 


elationships between job classification 
most important by laboratory workers, 
nicians and professionals had a some- 
ith respect to the kinds of rewards one 
hnicians placed heavy emphasis upon such 


groups had some interest in oppor- 
alue seemed to be much more im- 
ho valued it much more highly than pay 
there is a question concerning whether these 
values are the result of socialization taking place in the laboratory or 
elopment from some previous 
n the kinds of job classification 
: it is rather clear that these 
ing, socialization that results in 
alues about life work. 
» and Baxter (1962) conducted a study that por- 
e of values acquired by insurance salesmen through 
findings indicated that insurance salesmen have 
hat even though they are 
rough which they receive 
eir occupation had greater 
d high or low gratification 


Youngberg, Hedberg 
trayed the significanc 
socialization. Their 


extrinsic rewards, 


power for determining wheth 
from their work, 


gests that adult socialization is highly 


erhaps it should be one o 
a y and research development; nothing woul 
seem more important for the development and survival of the American 


Society than the formulation of knowledge that would enable people to 
perceive accurately the kinds of social lear 


ning being acquired by 
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American adults. Lack of this kind of knowledge ultimately led to the 
downfall of many of the influential societies of the past. It would seem, also, 
that the ability of Americans to help other countries depends largely 
upon their ability to understand and perceive accurately the nature of 
the socialization processes operating in these countries. Regardless of 
the political, cultural, and economic stature of any society, its ability 
to develop, maintain, and perpetuate itself depends upon the kinds of 
social learnings that are acquired by its members. 
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CHAPTER VI 


The Group as a Sociopsychological Setting 
for Learning 


ROBERT D. BOYD 


Over the past decade, concern has increased for the interrelationships 
among variables in small-group research. The need for explanation of the 
interplay of structures and processes in the dynamics of small groups has 
long been appreciated (Homans, 1950). There has been much evidence in 
recent years of the growing sophistication of researchers and research 
methods in being able to handle multidimensional aspects of small-group 
research (Hare, 1962; Golembiewski, 1962). The increased ability to 
account for variability of findings as a function of identified interacting 
variables has direct application to education. In the educational setting 
there is little control of the many variables operating in the learning situa- 
tion; thus, it is necessary to understand their dynamic relationships. 

In the theoretical phase, model building may offer direction to the solu- 
tion of the problem of interacting variables. Three types of models hold 
exciting possibilities for untangling and giving order to the mass of 
accumulated data on small-group research: (a) the general systems theory, 
(b) the mathematical model, and (c) the electronic digital computer model. 
Noteworthy exemplars of the general systems theory are the works of 
Parsons, Bales, and Shils (1953), Homans (1950, 1961), Thelen (1959), 
and Heider (1958). A report on mathematical models was presented by 
Criswell and others (1962). The merits and possibilities of the electronic 
digital computer model were presented by Hare (1961) and poe pao 
It is possible to combine the benefits of all three types; eg Gulla orn a 3 
Gullahorn (1963) combined certain eri propositions in Homans’ wor 
with nic digital computer model. . 

epe uu that models may make to the educator’s understanding 
and effective and efficient use of small groups are in the speculative sage. 
It is assumed that when educators are able to have a relatively ni "e 
comprehensive understanding of the dynamics of the interrelationships o 


variables, they will be able to best use the findings in zee any AEE 
Previous reviews of the literature appear to fragment the results rather 


than to give order and relationship to the findings. In an es to cor 
the findings of research, the author of this chapter organized the [gei 
into a simple paradigm. For the organization of the data on small-group 
research so that one may look at structure, process. and outcome rather 
than at isolated variables (concepts) such as leadership, a paradigm of 
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input, operation, and output may be profitably employed. Input contains 
those variables that are seen as the component factors upon which groups 
are composed, such as the number of members, the nature of the task, and 
the personality of the members. Operation contains chosen variables that 
are seen as developing through the life of the group, such as role differentia- 
tion, norms, and leadership. Output contains those variables that are the 
product of the group, such as productivity, visibility, and status. Generally, 
research can be placed within one phase of the paradigm. 


Input 


The relationship between personality 
much attention. The difficulty of predi 
from personality inventories remains 
of the difficulty of isolating and mea 
cause of the many variables in th 
variables, such as nature of the g 
alities in the group, 
(1962) did an ex 


and behavior in groups has received 
cting group behavior of individuals 
a difficult problem not only because 
suring personality traits but also be- 
group situation. Even the more obvious 
roup's task, leadership, and other person- 
indicate the complexity of the task of prediction. Hare 
celent review of the literature, Generally, central 
nality do persist in group situations. Powell (1962) 
ble to predict the ability of an individual to distinguish 


.- In8 personality studies within the pu 
Oup situati 
found that both member P situation. Haythorn and others (1956a, 


behavior and le dershi ied significantly 
between groups having high adership style nes 


ge when interact; : ies were 
used, They found that in ction process categories 


groups th us the older teachers talked more 
g © Younger teachers talk d T (1963) 
found a greater f à SEHE ed more. Toman 

nd a greater frequency of conflict by sib] 8 position in spouses where 


; 59), in his study of triad 
ences between sexes which 
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led him to conclude that the sexes differ significantly on certain behavior 
patterns in groups. He identified several of these based on a condition 
game problem he employed with triads. Borgatta and Stimson (1963) 
found differences between sexes on interaction process scores. They con- 
cluded that differences between males and females are of two types: (a) 
some variation in the factorial structure and (b) differences in amount of 
given categories of behavior. Subsequent papers. by Uesugi and Vinacke 
(1963) and by Vinacke (1964), substantiated the existence of significant 
differences in behavior patterns between males and females in small groups. 
Earlier research by Strodtbeck and Mann (1956) showed some differences 
between male and female behavior on the interaction process categories. 
They stressed that such differences should not obscure the determinative 
influence of the problem situation. 

One of the crucial considerations in the input system appears to be the 
task. In large part, much of the reward system is directly or indirectly tied 
to the nature and demands of the task. The definition of a task is the 
situation. Hoffman and Smith (1960) 
emphasized that the nature of problems used in research studies may 
directly elicit the role differentiations that are observed. They stated that 
each problem used in their study appeared to elicit certain types of reactions 
from group members, and these reactions differed from problem to prob- 


lem. Hoffman and Smith concluded that personal attributes of individuals 
amined after considerations of problem require- 


in a group should be ex 
ments and group characteristics. Zander, Natsoulas, and Thomas (1960), 
s goals for its members, found 


in a study of personal goals and the group" l 
that members who set their levels of aspiration more congruently with the 
group goals showed greater involvement in their group than did the mem- 
bers who set their levels of aspiration less congruently with the group goal. 
Such results as these have many implications for education. It is imperative 
that the nature of the task be clearly perceived and modified if necessary to 
be congruent with the educational objectives being sought. i — 
Size of groups has concerned many researchers. Research resu : 

theory have related size to such variables as resources of group mem x 
(Thelen, 1949), nature of task (Shaw and Blum, 1964). sex roles (Vinac e. 
1959), aid satisfaction (Thomas and Fink, 1963) . Initial work in this area 
established that satisfaction decreases as number of members iieri 
As one would expect, the number of interactions per member deer E it 
increase in size of group. and the rate of giving information an sugges ions 
increases as size increases. Recent work indicates that optimal group size 
appears to be five members (Slater, 1955). Thomas and peat chee re- 
viewed 31 empirical studies related to group size. They reporte th at many 
factors, such as distribution of participation and group organization, were 
found to be significantly affected by group size. They stated that methodo- 
logical shortcomings prevented more inclusive generalizations. Notwith- 
standing. one may conclude that size is an important input variable. 
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Operation 


i lysis 

Communication as one dimension of -he operation fiue ee ai 

aradigm encompasses such factors as rate and type o em poem rien 

€ feedback, and control. Mitnick and McGinnies (19 ) Poe rs 
the most active participants in discussions (on ethnocentrism) le 


most. A well-documented and integrated discussion of the researc 


among group members, The slow performers were 
somewhat more sensitive to the difficulty of th. 


sic idea of congruency 
e field as Cooley (1902), Mead 
k on the concept of congruency by 
t much of an individual's behavior 
ttempting to secure congruency in 


operation. They stated that when 
ed as obtained, a 


Y workers in th 
€cent wor 
howed tha 


(b) misrepresenting the other’s 


necessary and the areas in which it must be expressed. Some work was done 
212 


June 1965 Tue Group as A SOCIOPSYCHOLOGICAL SETTING 


on this question by Backman and Secord (1962). They found that the 
greater was the interaction, the greater was the likelihood for perceived 
congruency. Their data showed that liking and interacting are almost in- 
dependent dynamics (r=0.18) when related to congruency. They sug- 
gested that generalized liking may overcome a state of specific 
incongruency. 

Shaw and Penrod (1962) examined the nature of the relationship be- 
tween amount of information and group effectiveness. They were able to 
show that information is linked to the recipient's attitude toward the infor- 
mation and the source of information. If the provider could validate or 
document the information, it was accepted. Such information was linearly 
related to influence and effectiveness. Individuals possessing influence 
based on validated information could give moderate additional information 
which resulted in increased influence. Too much additional unsubstantiated 
information decreased an individual’s influence. 

Homans (1961) stressed the importance of the reward system as a deter- 
minant of behavior. Leaders have been identified as one source of rewards. 
McClintock (1963) found that leaders make more positive affect responses 
(rewards) than do other members of a group. Support appears to facilitate 
interaction among those who generally find it difficult to enter into inter- 
action; in this connection, Shaw (1960) demonstrated that a greater share 
in the group task leads to greater task motivation of group members. Miller 
and Hamblin (1963) reported that under conditions of high task inter- 
dependence a strong negative relation exists between the productivity of a 
social system and the degree of differentiated rewarding for relative achieve- 
ment. Miller and Hamblin made a significant contribution to research by 
demonstrating that the cause of conflicting results on research dealing with 
cooperative and competitive tasks was due to lack of independent control 
of the differentiated reward and the interdependency of work relationship 
on the task. Rosenberg (1960) showed that a member does not change is 
behavor significantly unless the reward and punishment system is coor- 
dinated to his behavior. Reward alone appears to be insufficient. Zajonc s 
(1962) work showed that reaction time of experimental teams was inhibited 
when members received only knowledge of the group’s success or failure 
and were given no feedback on individual performance. It may be that en 
bers not only have to receive rewards of an achievement type but ^ nee 
to check their perceptions with others. The latter, of course, could be con- 
sidered another form of reward. D. . 

Leaders may be part of the input system, but leadership is definitely an 
operation phenomenon. Fiedler (1962), reporting his findings regarding 
leader attitudes, group climate, and group creativity, reached the following 
conclusions: (a) leaders who perceive their least-preferred co-workers 
favorably tend to be most effective under pleasant and relaxed group con- 
ditions, while leaders who perceive least-preferred co-workers unfavorably 
are most effective under unpleasant, stressful group climates; (b) group 


213 


(XXV, No. 3 
Review oF EDUCATIONAL RESEARCH Vol. X XX 


a m T 
creativity requires € nm ele od pmo oar 
ion- directive leadership in more s + un] 
ee pr stated that leaders made fewer task-oriented um 
more positive affect e a fewer oo pi caine a 
itions than did other mem j 

cma gs is make significantly more task-oriented si em 
than other members was not supported by this research; ima e 
gested some very intriguing and possible explanations for this resu : aci 
(1962) suggested that the best rule for selection of leaders is to se ran im 
individual who has the necessary traits to satisfy the functions neede ii 
the group. Bass (1960) gave a comprehensive and worthwhile discussion 

leadership as a conceptual framework. - 

Considerable attention in the literature on small-group research has bee 


given to the topic of roles. Findings of Hoffman and Smith (1960) appear 
s challenge the results of earlier Work in role di 
Slater (1955), who found that gr 


€ to a point, but as the 
tiation show 


» of social control by H 
Cartwright and Zander (1960). ertainly, the central objective in the 
employment of the instructi 


search by Kiesler (1963) 


Output contains such outcomes as Y; reward, satisfaction, visi- 
bility, and status, Although the 


study by Mann, Indik, and Vroom (1963) 


ie» Witerpret (Collins and Gnetskow. 1964]. 
Homans (1961) presented a discussi ds which has merit for the 
instructional group. In an investigation of visibility, Marwell (1963) con- 
cluded that there is some gen i 
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function of visibility and status have not been extensively explored in the 
instructional group. 


Summary 


The author of this chapter has attempted to report research that pertains 
to basic conceptual threads that have been woven into the study of the small, 
face-to-face instructional group. A paradigm was proposed to account for 
the dynamic relationships among variables, by structuring input. operation, 
and output phases. Within each of these phases were reported a selected 
group of recent research studies and new books in the field. The author’s 
intent was to highlight some of the recent advances in small-group research 
as such research applies to the adult instruction group. 
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CHAPTER VII 


Materials, Methods, and Programs for 
Literacy Education 


ROBERT F. BARNES 


E E rt, 
bulk of the research conducted and reported in this field. For the most Él 
the research conducted thus far has been done by persons directly invo din 
with adult literacy education; very little has been conducted by pers 


Materials 


Developmental 


Publishers Weekly (1964) reported the findings of a U.S, Office of Edu- 
cation task force in à review and appraisal of existing instructional mate- 
rials for adult literacy education. The task force found “a serious shortage 
of materials for teaching basic reading skills, especially word p ant 
: ressed a Suggested basic vocabulary instead o 


y already Possessed by adults. The materials were 


lans and evening school principals 
t what grade levels of readability 1s 
reading materials for adults?” The 
frequently, These librarians 
ip, homemaking, and family 
acute needs. The validity of 


Hd puso: > depends a great deal upon 
reading ability, Principals Worked with adults of low 


» asking them, “A 
there the greatest shortage of suitable 
upper elementary levels were me 


» Which are som 


he factors influencing the 
es of personal health one 
lecti reading ability. The number ie 

i Select " 
ebay associated with difficult Ton Was found to be the factor mos 


- Next in line was the 


June 1965 LirERACY EDUCATION 


Field Testing 


To test the basic fundamentals of reading and spelling, Levi (1964) 
used 204: subjects from Chicago, 128 of whom were adults with an average 
schooling of 5.8 years. After 99 hours of prescribed instruction, the mean 
difference between pretest and posttest scores of these adults was 4.6, with 
a standard deviation of 0.65. These figures must be interpreted cautiously, 
however, since it is not exactly clear what this gain represents. Two of the 
tests used had norms based on elementary school populations, and no effort 
was made to utilize a control group. 

A more structured experiment was conducted by Henney (1964b), who 
used the Family Phonics System with 30 inmates of a penal institution. 
These subjects were divided into three groups: a control group, an experi- 
mental group that received one-to-one tutorial instruction, and an experi- 
mental group that received group instruction. He found that 20 one-hour 
sessions increased reading ability by as much as 2.5 grade levels for the 
functionally illiterate inmates and that it made little difference whether 
the teaching-learning situation was on a group or on a one-to-one basis. 

Another experiment conducted in a reformatory was directed by Allen 
(1961). The Laubach literacy films were used for instruction of 288 
inmates, 108 of whom graduated from the program. The showing of the 
films required 49 hours, and extra sessions were used for classroom 
instruction, review, practice, and drill. The subjects had a claimed median 
educational experience of 5.8 years but a mean measured educational 
achievement score of 1.97. Those completing the course increased an 
average of 2.5 grades in reading, vocabulary, and spelling. The results 
must be viewed with caution since the sampling technique used was uniden- 
tified, no control group was used for comparison, and the test instruments 
had norms based on school children. Also, the results from the two previous 
field test efforts cannot be generalized because of the confined population. 

Henney (1964a) again tested the Family Phonics System and apparently 
did not limit it this time to a penal institution setting. The improvement 
in scores ranged from 3.5 to 7.4 grade levels after 160 hours of instruction, 
and he reported a significant difference between the gains in reading ability 
of the members of the experimental groups and the gains of the control 
Stoup. The results of this field test can be questioned because of the follow. 
ng limitations: the test instruments, the lack of a description of the control 
Sroup, the apparent absence of any attempt to select a random sample, and 
wm absence of any effort to match the experimental and control 

Additional References: Cass (1960) ; Moore and Hendrickson (1958); 

eijs (1961); Ward and Brice (1961). 


Methods 


The interim report of the Norfolk experiment directed by Cooper (1964) 
ates the use of team teaching with four closely matched groups of “hard- 
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core unemployed,” the majority of whom did not pass the General Aptitude 
Test Battery. 'The main experimental group was given intensive instnichon 
both in general education and in technical training; the subsidiary pee 
mental group received only technical training. The main control group w Hs 
tested only at the beginning and end of the project, while the subsidiary 
control group received guidance and occupational information on a bi- 
monthly basis to test the Hawthorne effect. The intermediate results re- 
ported by Brooks (1964) showed that 90 of the 100 men in the two 
experimental groups graduated, and all of the graduates were placed on 
jobs, most in their respective fields, 

Peerson (1961) directed an experiment that involved televised instruc- 
tion of illiterates with Laubach’s Streamlined English and supplementary 
materials. Classes utilizing direct teaching were organized for those areas 
having poor television reception. One-fifth of the group of 608 subjects had 
a minimum of five years of schooling, and one-fifth had had no previous 
s two or three years. After approxi- 
on and review sessions, the groups 
are these adults with a national norm 


1 8roup was composed of interested home 
viewers and students enrolled in 3] Memphis centers. The average grade 
level achievement Scores were at the 2.5 grade level or above, with the total 
average achievement at 2.8, However, no pretest scores were used, and a 
escription of the sample was not given. The programming method was 
tried with adult illiterates by Crohn (1964), Intermediate data indicated a 


n "PT . H H H H i 
eed for further training in visual discrimination and an increase in the 
use of context as stimuli support, 


As a result of a 1955 survey of 30 Programs, Cass (1956) identified some 
common problems of administrators, Successful methods and techniques; 


training and recruitment practices, and a general overview of the structural 
organization of basic adult education programs 
g 3 


Additional References; Hilliard (1963) ; Olson (1963) 


Since very little knowledg 
of the uneducated gro 

Bray (1953) chose a representative sample of 400 from the 303,000 
illiterates received by the Army for training in basic literacy subjects. The 
typical trainee came from à community of under 5,000 persons, had com- 
pleted more than four years of schooling, was single and self-supporting 
and gave no evidence of having ever broken the law Of the 400 selected. 
343 graduated from this basic program. The evaluating criteria emphasized 
220 
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pic ec type of discharge, amount of time lost for medical and 
pain t y rhe and time of discharge (before or at the normal 
mete. = of his period of service). Of 331 graduates, 38 percent were 
eel SE s very good in performance, 49 percent performed acceptably, 
P esae pos were considered a loss to the Army. In addition, about 
ed pe of these graduates applied to the Veterans Administration for 
a benefits after leaving the service. 

Paces D uui was interested in the relationship between vocabulary and 
vind w xni is constant. He administered several tests to 156 
vea iers whose mean IQ for all eight reading levels was 77, 
ae in 75-79. He found that literacy and vocabulary were somewhat 
alc a of one upon the other appears to operate in 
oun oe directed a study to determine the literacy level of people 
oan pi gio aid even though physically able to work. The sample 
ead o 0 adults on relief, most of whom were Negro women from 
Fia à with a median grade completion of 9.5 years. Only 6.6 
ua on not completed five years of schooling. However, when a 
i rdized test was administered to each and the scores were compiled, 
: .7 percent of the total sample scored below or at the functionally literate 
evel (fifth grade reading ability). 

iu References: Altus (1950); DeGabriele (1961); Hertert 


Conclusions 


Pm the lack of basic and applied research in adult literacy education, 
n rom the tremendous emphasis now being placed upon this aspect of 
education, it is apparent that without a great deal of further sound research 
this field will continue to be little more than the present conglomeration 
of hit-or-miss programs. Well-designed research is badly needed in such 
related areas as sociology and anthropology so that teachers and adminis- 
trators in the field may better understand the illiterate adult, his environ- 
ment, and his society. Research is needed, too, in areas such as learning, 
motivation, teaching methods and techniques, and teacher training 
techniques. 

The reviews of field testing that ha 
Up a need for a sound design strateg 
tional materials. Instructional materials 
needs of the adult illiterate. Finally, it appears t 
identify accurately the functionally illiterate segment o 


ve been completed and reported point 
y for future field testing of instruc- 
must be developed to meet the 
hat there is a real need to 
f the population. 
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CHAPTER VIII 


Procedures and Methods 
for Community and Resource Development 


JOSEPH L. MATTHEWS and LINNEA B. HOLLAND 


If the aim of education is to bring about a change in the behavior of 
the learner, then adult education is one method of changing ene 
in line with the objectives of the community development effort. It x 
lows also that the total of community development is more than adult 
education and it functions in the context of social, economic, and physi- 
onducted or evaluated in isolation from 
ide whether community development is 
4) wrote that it is often difficult to see 
h of the activity called community de- 
l method is involved to the extent that 
help the people of the community learn 


or is not education, Miller (196 
any educational process in muc 
velopment. However, educationa 
the worker in the field tries to 
problem-solving processes, 

The limited amount 


of research on th 
is probably a result of 


© community development process 


of the research problem, the paucity 
of resources for this kir and the prevalence of adult educators 
who do not feel they need to develop a theoretical base for their programs. 

ality of research varies widely, 


Fliegel (1964) y 
development setting 


their social life; (b) 


committed or a cause to which they are dedicated, 


Leadership and Training 


Several studies attem 
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Brown (1963) found that women leaders were well educated. knowl- 
edgeable about adult education, and willing to help increase educational 
opportunities for women in their communities. Studying influential Negroes 
and Negro organizations, Barth and Abu-Laban (1959) found that the 
Negro subcommunity lacked large-scale business and industrial organiza- 
tions and that Negro leaders were primarily concerned with issues of 
an interracial nature. Auerbach (1961) and Mulford (1962) noted that 
the “power elite” formed a cohesive social group and interacted more with 
fellow leaders than with other community members. Danzig (1959) 
identified 11 motivations which were acknowledged as conscious desires for 
the assumption of leadership positions. 

While training of community leaders would seem to be rather crucial, 
there was little genuine research in this area. Franklin (1959), working 
with 14 university consultants, compared three training methods and 
found three major obstacles to better training: (a) lack of interest in 
or need for training expressed by the leaders; (b) tendency to study things 
and projects rather than people and processes; and (c) antidemocratic 
values and rigid perceptions held by the trainees. Devereaux (1960) noted 
a relationship between community leadership, socioeconomic status, and 
certain other correlates of education, occupation, prestige, values, and 
attitudes. Vested interests and a proliferation of roots locally increased 


leadership involvement. 


Group Behavior 


group structure of both rural and 


An understanding of the changing n 
engaged in 


urban communities is a necessity for the adult educator i 
Lindstrom (1961) studied interest groups 1n 


d found that most had sociocultural 
rgroup cooperation on the com- 
attempted to determine causes 


community development. 
which farm people participated an 
linkages. He stressed the need for inte 
munity level. Alpert and others (1960) o dues d 
behind the apparent lack of community group participation bo ie 
high-rise housing projects, and Young and Mayo (1959) studie Foe 
fest and latent participants in a rural community action program. . p 
found that residents with low income and recipients of public assistance 
tended to participate less than did persons from ee gom 
Although willing to get together to solve existing problems, they 

e one existing community action group 


not willing to participate in th ; 
i Di 

because it had no recognition or authority from the project management. 

und socioeconomic status closely 


While Young and Mayo also fo omic : e 
"PEL b 

n ticipation, they felt that inactivity in communi 
related to level of particip: able attitudes or even apathy, 


: i rily t unfavor 
programs did not necessarily reflect uni PACA "4 
but demonstrated a lack of understanding or knowledge of the organiza 


tion. A similar finding was expressed by Folk (1962), who studied 
organized community groups and their impact on the schools. Although 
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groups were interested in bringing about improvements in the educa- 
tional program, they depended on the school to furnish leadership. Davis 
(1960) found that citizens did not have the information they needed to 
make rational decisions about educational matters for which they were 
responsible and that positive relationships were found between public 
knowledge and attitudes. 


Professional Aspects 


What is the function of the 
What type of education or train 
on the professional community 
form of conjecture or opinion 
Several studies were made by 
(1963) concluded that (a) urba 
in sociology than their rural c 
considerable training in comm 


professional in community development? 
ing does he need? Much has been written 
leader, but most of it has been in the 
z Very little has been based on research. 
cooperative extension personnel. Kalangi 
n extension agents required more training 
ounterparts and (b) both groups needed 
unity development, public speaking. and 
(1961) concluded that those extension 
Y administration. on the basis of their 
equally successful in a broadened com- 
ined and given time to make the role 
I à ‘ 
itelemdensis adel cit perative extension, Dickson (1959) found that 
citizen groups in (a) studying community resources 


; ble to it, (b) identify; ing 
community pm x >. entifying and evaluating 
y problems, (c) making decisions about community goals and 


objectives i : 
J , (d) developing plans of action to achieve goals and objectives, 


(e) organizing citi s 

Ò 8 citizens to expedite " 

^ act S . 
progress and accomplishments ton on plans, and (f) evaluating 


and outside resources availal 


Social Action and Change 


measurement of socia] change. A hel f 

Rogers (1960) established be S ps 
developed hypotheses to be tes 
and Hardee (1963) 


dy by Andrews, Bauder, and 
rural industrialization and 


year studies in which sub- 
grams. The per- 
e in influencing part-time 
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involvement. The introduction of the program from the outside eliminated 
the suspicion that someone was trying to gain a vested interest. and the 
award became the primary group action goal. 

One of the better studies of social action was done by Sanders (1961). 
who made a case analysis of community controversy on the issue of 
fluoridation and described a series of stages which led to community 
decision. The stages included (a) initiation. (b) preproposal. (c) pro- 
posal, (d) community action. (e) decision, and (f) aftermath. Sanders 
found that the professional initiators (e.g., dentists, public health officers) 
were usually unprepared for the political leadership required in the 


community action stage. 


Attitudes, Views, and Beliefs 


The desire to improve or to develop resources depends to some extent 


on the attitudes, views, and beliefs prevalent within the community. 


Folkman (1962) noted that chronic low income was more closely related 
to the system of attitudes and values than to limited employment capa- 
bilities. while Burman (1959) found interest in intellectual. development 
virtually nonexistent among adults on lower socioeconomic levels, although 
interest in vocational training was evident. Press and Hein (1962) found 
that the need for orderly development in a rural Michigan township 
adjacent to a metropolitan center was considered to be the least urgent 
by the key group (commercial farmers) that had the power and organiza- 
tion to initiate action. Porter and Lyon (1963) studied attitudes, opinions, 
and levels of understanding of cooperative extension staffs regarding 
al areas. The staffs considered (a) the creation of public 
program and (b) the development of 


development of rur 
awareness and understanding of the 
local leadership as the two most serious obstacles. 


Procedures 


to the problem of useful procedures. 


Some attention has been given proc 
a theory of motivation for 


Beers (1959) presented 13 propositions in 
community organization. JN . " . 

Community development clubs have been initiated in severa ocations. 
o were studied rather thoroughly by 


Those in Alcorn County. Mississippi. $ 
Baird and Bailey (1960) and again by Cole and Kaufman (1961). Two 


distinct stages in the growth of community clubs were ane 
Baird and Bailey; variables affecting their success were isolated by Cole 


and Kaufman, who found that the presence or absence of such groups 
pend more on a relatively high socio- 


amilies. A long history of interaction 
1 to subsequent growth and success 


within a community seemed to de 
economic level than on numbers of f. 
within a neighborhood was closely related 
of community development clubs. 
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Community Resources and Needs 


How can needs be determined? How do communities perceive their own 
needs? What resources are available for meeting needs? Several research- 
ers attempted to find answers to these questions. Aiken (1963) designed 
a model for assessment or critical analysis of community situations. 
Gambill (1961), using seven different survey techniques, successfully 
designed and tested a method for determining needed community services 
in Georgia. Public affairs interests were studied by Brown (1961) and 
by Hazlitt, Boone, and Matthews (1960). Concern was expressed in regard 
to industrial development, employment of 


pportunities, and school facilities 
in the survey made by Brown: while Hazlitt, Boone, and Matthews found 
about three-fourths of the respondent: 


s interested mainly in public affairs 
and other liberal subjects and in 


more opportunities for this kind of 
adult education. Community facilities and resources were also studied by 
Williamson (1958), McKinney and others (1961), and Douglas and others 


(1963). While the focus varied from study to study, it was generally found 
that many small communities lacked both financial resources and demands 


for nonvocational adult education and that local government was perceived 
as being inefficient or too small or both, 


E involvement. in a community study or project have any lasting 
ee ape cs Three Separate studies indicate that it does. 
can ound that active student icipation j i 
ie we ae tA e participation in organized college 
eani Y related to subsequent involvement in community 
ganizations. Both Mitchell (1960) and Hollenbeck (1959) found 


increased interest į iti i 
t ater citizen participation following 


rlier involv : P. 
pes i ee community survey or study programs. Caldwell 
: ose n ing the effect of a university- 


qo and, in assessi 
Sponsored develo 
P! pment s at many People were not well 


0 à tudy, found th 
"A mp p oa public school Program. Because of only partial success 
in the acti 

action phase portant recommendations of the study 


, the most imr 

group were not implemented hil 

2 QW lle some of t A 

accomplished. he less important ones were 
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CHAPTER IX 


Clientele Analysis 


ALAN B. KNOX 


During the past decade, adult education research and studies categorized 
as clientele analyses became a major segment of all research in the field. 
A clientele analysis is a description of the characteristics of the participants 
in adult education programs of one or more agencies in comparison with 
the characteristics of the general population of adults who could potentially 
be served. The characteristics frequently used for comparison include age 
distribution, proportion of men, level of education, program interests, 
marital status, and occupation. The participating adults were often com- 
pared with the total potential constituency, as defined by the agency in 
terms of geographic area and eligibility requirements. Three approaches 
have been made to clientele analysis. One approach compares the charac- 
teristics of adult education participants in all agencies with nonparticipants 
on a national or regional basis. The second approach analyzes the char- 
acteristics of participants within a type of agency, such as a school or 
university or library. The third approach describes the participants in one 
agency in a specific community. Studies using the third approach have 
been most numerous and, in general, methodologically least rigorous. 


However, they have served several important functions: (a) confirmation 
of more general research findings, (b) indication of variability within a 
type of agency, and (c) direct contribution of the findings to administra- 
tive decision making within the agency in which the clientele analysis was 
conducted. . 

Clientele analyses are an important segment of adult education research 
for these reasons: (a) since so many modest clientele analyses have been 
conducted, a review of the most rigorous clientele analysis research can 


encourage more effective studies; (b) clientele analysis findings can m. 
cate the degree to which target audiences are being reached: P es e 
analyses can provide evidence for more eflective counseling and teaching 
of the adults being served and for more eflective program development 
and promotion to reach other adults; (d) clientele analyses can encourage 
greater interagency coordination by indicating how each agency can best 
complement the other's activities in terms of audience reached ; (e) clien- 
tele analyses can contribute to theory related to sociology of education. The 
voluntary nature of most adult education participation allows an examina- 
tion of relationships between characteristics of agencies (such as image, 
subject matter, educational method, and promotion) and characteristics of 


the client system that is attracted. 
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Characteristics of Adult Education Clientele 


(1963) and a citywide study by London, Wenkert, and Hagstrom (1963) 
provided supplementary information. The integrative reviews of prior 
research on clientele analysis by Verner and Newberry (1958) and the 


characteristics of adult education participants differed 
participants. Because the most detailed and representative data were 
provided in the National Opinion Research Center (NORC) study by 
7 ed as the primary 
sed for elaboration 


tive national sample of 24,000 adult 
ticipation, as a part-time student. 
form of instruction of which th 


5. Adult education was defined as par- 
> in an activity organized around some 


e main purpose was to learn or acquire 
Some sort of knowledge, information, or skill. The purpose of the study 


was to determine who Participated in what types of adult education pro- 
grams and for what reasons. The data from this well-designed study have 


detailed and accurate analysis to date of the adult 


: 8y restricted, Data Were based on a 
community Survey of 2 70g interviews a 
sample Survey of 203 adult e i 


of this Baseline Study of 


e > and research procedures. 

g aphic variables in the three studies were reported 
Provided a basis for testing 
Brunner and others (1959) 
mal education, higher status 
Tager than nonparticipants. 


er for rural t an for urban residents and 
dwellers, 


1 “pants had more for 
Occupations, and more income and were yo 


he rate of Participation was low 
was especially low for small-town 
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Table 1 presents data that indicate the tenability of the earlier generali- 
zations regarding the general clientele of all adult education. The best 
current estimate is that 15 percent of the adult population participated in 
some agency-sponsored adult education program during the past 12 months. 
There appeared to be no substantial difference between men and women 
or between married and single persons in gross rate of participation. The 
data from the studies by Johnstone and by London, Wenkert, and Hag- 
strom indicated that the participation rate declined with age. However, the 
life-cycle analysis by Knox and Videbeck demonstrated that when variables 
such as level of education and life circumstances were accounted for, most 
of the age-related differences disappeared. The positive relationship between 
adult education participation and level of occupation, income, and educa- 
tion was demonstrated by all three studies; however, the highest degree of 
association was with level of formal education. Both the Johnstone and 
the Knox and Videbeck studies showed a positive relationship between 
community size and rate of adult education participation. 


Agency Differences in Clientele 


As was pointed out in the chapter on participation by Brunner and others 
(1959), there is an interrelationship between the clientele of an adult edu- 
cation agency and agency offerings. Johnstone (1965) demonstrated that 
different types of agencies offering comparable adult education programs 
attracted different audiences. : 

Table 2 indicates the proportion of all adult education sponsored by 
each type of agency, as classified by Johnstone, and the major aje 
matter areas offered by each type of agency. Based on the Johnstone study, 
21 percent of adult education was sponsored by colleges and ee 
12 percent by schools, and 4 percent by the military. The somewhat dif- 
ferent distribution in the London, Wenkert, and Hagstrom study seemed 
to be associated with the all-male sample. The distribution of sponsorship 
in the Knox and Videbeck study was similar to that in the Johnstone study. 
The generalization that adult education was primarily ie em pe re- 
lated was clearly supported. For all sponsors, except s an xum 
munity organizations, courses classified in the vocational subject m. er 
area constituted a major portion of the offerings. In general, the egree 
of agreement among the three studies regarding the major a matter 
areas offered by the various types of agency was exceedingly m 1 bui 

During the period between 1959 and 1964, the largest number of pub- 
lished clientele analyses was conducted in college and university adult 
education programs. Table 3 presents data regarding participants in college 
and university adult education programs, based on the Johnstone study, 
along with data from five other studies that indicate variation within this 
type of agency. 
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————— THOME 


Tanz 1. Percent of Adults Within Various Demographic Categories Who Had 
Participated in Any Adult Education Activity During a Twelve-Month Period 


Oakland (men) Nebraska 
NORC London, Wenkert, Knox and 
Demographic Categories Johnstone * and Hagstrom * Videbeck * 
Total Adults 20% ** 12% 12% 
Sex: E 
Men 21 12 12 
Women 19 = 13 
Age: E 
20-29 29 20 9 
ibe : : ; 
` 10 
50-59 16 7 m 
60 or over 14 = 1 
Marital Status: m m 
Married 21 
Single 21 = 5 
Occupational Type:t ü 
Professional 45 
Managerial 25 22 33 
Clerical-Sales 27 16 16 
Craftsmen-Foremen 26 16 15 
Operatives 13 6 13 
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Laborers 9 10 7 
10 6 
Income: 
$4,000 or under 12 
$4,000-6,999 2 11 = 
$7,000.9,999 a 12 = 
$10,000 or over 32 14 — 
15 - 
Education: — > 
8 Years or Less 6 
9-11 Years 15 5 6 
High School Graduation 24 8 8 
13-15 Years 36 10 Ll 
College Graduation 39 21 17 
17 Years or More 47 23 30 
is "M ee s 35 
Siti, er: 


* The percentage rate of icipati 
B participa 
(1965) ; London, Wenkert, and un. 


** Five Percent of ad, i 
ults in the YORC 
Fifteen percent " Study were en aged in ; r 
brska ti : 2d enrolled in agency-sponsored fiona run naopen qay 
gures include only those adults enrolled jn H +5 adina Em 
"Sponsored programs. 


mee ame 


ibus Ero) from data Presented by Johnstone 
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Taste 3. Percentage Distribution of College and University Adult Education 


Participants Based on Several Studies 
General NORC Wash. 


Adult — College Univ. Buffalo Illinois UCLA 
Popula- and National Evening Grad- — Exten. Liberal 
Characteristics tion Univ. — Survey College — uates sion Education 
Men 47% 57% 66% T7495 90% — 38% 
Age: 
Under 35 31 54 73 79 92 55% m 
35-54 41 39 25 19 8 40 41 
55 or over 28 7 2 2 TM 5 Man 
Education: 
Elementary 59 } 1 3 — 8 3 
Secondary 25 24 22 32 "NM 32 13 
College 16 75 77 65 a 60 84 
Married 76 — s 52 = 74 n 
Occupational 
Type: 
Professional 11 =e = 
Managerial 8 - E 2 8 56 66 
Clerical-Sales 22 Ma = = z 10 24 
Income: = = 
$1,999 or under 13 T" 
$2,000-4,999 29 in X 6 3 — = 
$5,000-7,999 32 = T 26 35 E - 
$8,000 or over 26 = 33 45 — = 
: = = 16 17 = = 
Organizational 
emberships ; 
None 25 - 
: T uM x "4 = — 21 
2 18 _ = s — — 20 
G 14 I" 3 -= — 13 
4 or more 21 S 8 = ze 21 
n = = 9 T 
Source: Beginning With the NORC es 26 


the following: Johnstone ( study (above), data are dr; a 
a d s 1965); Knox « re drawn respectively from 
A ; Zeman (1960) ; Carter, Kerr, and York En (1965) ; Wientge and DuBois 
Data from Johnstone (1965), ^: Hill (1960), 


Data regarding the remainin types genci 1 
2 ao, epean catory wh Wisin ne 
cial oe Dania analy iby D. i and a rare An oa 
libraries, Studies y "Vasinger (gi) Progam closely as Bs with 
i ae l wt adie ten peated 
x 9Anstone data for elementary and secondary 
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ini ' were from 
School programs. The data for the Temaining types of agency were 
g 2 
study by Johnstone. f : " 
E: dit every type of agency was reaching some adults in om 
every population category, but the modalities were quite different. 
college programs included the high 


i i ; ; ili- 
with the exception of programs by business and industry and by the m 


Proportion of young male i lir 
y of evening class students conducted by 


D 


g of education, occupa- 
tion, and organizational participati ere included in the clientele at- 
program described by Hill (1960) 
e lecture and discussion methods. 
ristics of adults attracted to non- 


ored by universities and libraries 
was pointed out in the studies by Hill (1960) and Day 


related to the audi 


£4 school were being reached by the 
ol Systems, and the 


d and unpublished ady 


It education clientele analyses, 
s and findings 
(1964), 


> Were prepared by Knox and Hall 
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which similar programs regarding subject matter and methods attracted 
the same clientele, regardless of agency sponsorship., was demonstrated in 
the case of noncredit liberal education study discussion programs. The 
effectiveness with which some programs within each type Uf ssency at- 
tracted some adults indicated that most adult education agenais could 
attract almost any target audience if appropriate approaches to program 
development and promotion were utilized. The studies reviewed i this 
chapter provided important references for use by adult educators in the 
design and interpretation of clientele analyses in their agencies. 
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CHAPTER X 


Needed Research 


BURTON w. KREITLOW 


to launch new research and to integrate that whi 
In Chapter V, Jensen and Schrader developed 
adult socialization; in Chapter VI i 
in organizing completed 


o 
research and can be used as a base for the theory and research development 


+ innovation, and adoption 


: atterns among farmers and 
n such other fields 


] ons based on innovation 
graphy. On the basis of reports by Rogers 
> and Lionberger (1961), it can be 


established 4 Structure of 
field to be made by scho] 
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much is being accomplished, but the additive qualities found in research 
based upon well-established models are still lacking. Promising examples 
of models are beginning to appear. such as those on processes by Verner 
(1962). In contrast, the limited research on adult literacy education so 
completely documented by Barnes in Chapter VII shows that researchers 
in this area have no existing or proposed model around which to develop or 
organize findings. 


Needed Research 


Identification of needed research reported by Kreitlow (1964) was based 
on a systematic review of recent literature in adult education and related 
applied fields and disciplines: interviews with 31 national leaders; and 
analysis of Adult Education: Outline of an Emerging Field of University 
Study by Jensen. Liveright, and Hallenbeck (1961). An early draft of 
a research conference called by the Adult 
Education Branch of the U.S. Office of Education. On the basis of the above 
and Chapters I-IX of this issue, the author has listed research needs in two 
sections: (a) priority needs and (b) other significant needs. Examples 
of basic and applied research and suggested development projects are 


presented for the first two priority needs. 


Kreitlow’s report was analyzed by 


Priority Research Needs 


Each of the priority research needs listed in Chapter X and shes 
gested in Chapters I-IX can be placed in an appropriate segment ^ gi 
low's schema. There are problems and research proposals made y F3 
overlap from one category to the next. It is up to the researcher to clarity 
the basis of his choice of category for a specific research problem. 


1. By what means can the educational, technical, scientific, and cultural 


. : H ;? 
illiterate be educated for economic and social self-sufficiency ¢ 


A. Basic Research. 
Determination of the concepts, 
in modern societ . : 
Determination : the goal and value base of those at different literacy 
levels 

Development of models for cha 
self and society around him. 


B. Applied Research. "rm 
A n ; k lev T 

ing of instr nal models with groups varied as to (C) 
Pon Aare c) and determination, through in- 


(goals and values level of living, etc.) ar B, He b 
plant studies of patterns of behavior required by automating industries. 
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skills, and abilities basic to self-sufficiency 


nging the jlliterate’s perception of him- 
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Comparison of adults of the same literacy level with variability in their 
degrees of self-sufficiency 


Investigation of instructional models aimed at developing a more positive 
attitude toward work. 


C. Development of Demonstration Projects. 

Development of demonstration projects jointly with industries under- 
going major technical changes to show the effect of a rising level of 
literacy on technical adaptability 
Demonstration of how groups affected by 


automation can be identified in 
advance and retrained on the basis of the 


ir identifiable needs. 


2. What are the most appropriate conditions for 
are the personal (internal) and social 
changed behavior in adults? 


adult learning? What 
(external) factors that lead to 


comparative studies (youth 
d learning environ- 
k skill attainment, and 
z Determine the factors in the life span 
of learning. Determine 
on the willingness of 
incidents that lead to 
adults known to have 
or. Determine variables of language usage 
related to achievement in skills, concept attainment, 


change by use of 
sharply altered their behavi 
that affect abilities 
etc. 


B. Determine how an adults leve] 
in learning activities, 
controlled conditions w i 
c ondi “the oe Vatled: for exam] hing 
eked a g > lor example, teaching 
cal skills in the factory, in the factory shops ir Wi echnical 
shodi. Study PS, in a public technica 


chang ` i 
social background St rier adult groups of different personal and 
g - Study the effectiveness of selected procedures for 


inducing i 
g change among groups with cial and/or personal 


characteristics, Identify variables in adult illiterates that relate 


x groups o 
to selected behavior changes, TE 


C. On the basis of what j 
adult learning, establish settin 


ditions are met 
promising procedures noted above in a total program of change on a 
community or Institutional base, i i 
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3. Under what conditions can a teacher be most effective in motivating 
and stimulating adults to accept change? What communication variables 
relate to acceptance of change? 

4. In what ways can the existing socioeconomic and cultural information 
be used effectively in planning and conducting programs of continued 
learning? 

5. What are the factors of sociocultural change that most influence 
learning? In what areas of human concern are educator-induced social 
actions useful? In what ways can positive action be stimulated? What is the 
nature of community dynamics: how can it be harnessed to influence 
learning? 

6. Who are the participants in adult education? What has been the 
quality and quantity of participation in the past? What is it likely to be 
in the future? 

7. What resources within the instructional group can be used to maxi- 
mize the achievement of individual and group objectives? 

8. What are effective means of achieving competency in one's life that is 
satisfactory to family. social group. community, vocation, and one's role as 


an individual? 


Other Research Needs of Significance 


ation in learning activities (indi- 


l. at fa rompt adult partici 
ih p : ‘hich these factors are 


vidual or group), and what are the means by w 
internalized? 
2. In what ways can knowledge of adult attitudes be used to change, 
f continued learning? ; 
dge of adult interests be used to ad 
ation in adult education programs: 
4. What are the characteristics of personality change in the adult gue 
what factors influence the change patterns: and in what ways can n - 
edge of these factors be applied in programs of continuing eic 
5. What are the conditions under which one learns to Low E gen 
educational ingredients encourage learning to learn E vas rov E 
the practice of continuous learning be developed e ipee E ded 
children and youth be helped to develop attitudes. habits, and s 


ducive to lifelong learning? u da sd " 
6. What factors are responsible for a positive anticipation of change? 
What characteristics of educational pr is elici : 
sponses from those with different change pur n pan ee “= 
7. What is the role of the educator in helping adults ma P mH B 
physical, and retraining adjustments arising as a consequence of changes i 
: ark? 
the labor force and the nature of work? ] 
8. What factors, principles. and philosophy underlie the response of 
adult education to changes in th 


develop, or guide programs o S 
3. In what ways can knowle 


quantity and quality of particip 


ograms elicit positive learning re- 


e social and cultural setting? 
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9. What purposes have been sought and achieved; what goals and pur- 
poses are now being sought; and what should programs of adult education 
be trying to achieve? i ] 

10. What should be the relationship between the immediate and long- 
term goals of adult education agencies? 

1l. What administrative structures and organizations are most con- 
ducive to the development and maintenance of effective adult education 
programs? 

12. What are the influences of differ 
terns and independent study on the ach 

13. In what ways can the potential r 
organized, integrated, and activated to 
growth? 


14. In what ways do learning experiences of the volunteer teacher 
(leader) add to the learning outcomes of educational programs using 


volunteer teachers (leaders) ? What personal and social factors influence 
the effectiveness of volunteer leaders? 


15. What are the means by whi 


ent adult group participation pat- 
ievement of selected goals? 

esources for continued learning be 
meet society’s need for intellectual 


skills, values, etc.) 
17. What is the pro 


tinuing educational challenges of adulthood? 
19. What aspects of the graduate curriculum in 
useful for leadership in this emerging field? 
20. What individual and/or 


adult education are most 


measuring outcomes of voluntary and/or forced participation in 
programs of continued learning? 


Conclusion 


need. Likewise, if results are ti 
for intercommunication are gre. 
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FOREWORD 


There has been marked and ra m in a pis eei dn higher 
i i cent years. Individuals, co eges and 'ersities, 
E uer diner ee cn, foundations, and TA 
local political units are seeking answers to the varied an ca = —— 
of higher education in um Oey society. The search expresse 
i speaking and writing. EM 
k e hands is a vital part o eee of profound transition in al- 
ery aspect of life. web o spe 
Situation is higher education at this time: (a) the apparently SAM 
expectations of people, as expressed in a prolonged phenomenal € 
in enrollment, (b) the lag in adequate facilities, (c) the growing s 2 a 
of college and university teachers, (d) the Paucity of accurate know "T 
and theory concerning the nature and function of higher learning, (e) ; 
revolution in growth of knowledge, (f) the increasing demands on hig her 
education, (g) the culture-wide upheaval in value Systems, geen 
among the young, (h) a constant and growing financial crisis, an 
(i) threats to freedom from the extreme right and left. These and other 


of which thought, discussion, and writing 
ould not respond to such stimulation, and higher 


pressures creates a very special 


must come; only death w 
learning is not dead, 

Such demands cou 
formulations. This i 


In order to facilitate compari- 
ollowed in general that of the 


© areas have been added: (a) government 
(b) professional education, 


sons, the organization of this issue has f 
October 1960 issue, although tw 
and education and 


> requiring the examination of 


Significant research in education js extremely rare. The problems of 
research are bafflingly complex; the methods, lifted almost bodily from the 
physical sciences and often applied with a neurotic persistence, often bring 
disappointing results, To face this f 


tempts to apply the methods of physica] science to the problems of 


to research in education. Such ap- 
, imaginative, and comprehensive 
may be more like carefully conceived 
methods to which we are now wedded. 
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Thus, I would hope that in the next five years we will encourage es- 
pecially the ablest students among us to formulate theories and seek 
imaginative ways of testing them. Mistakes will be made and research 
design may not fit the accepted pattern; but significant discoveries and 
meaningful insights may be achieved. This is not a plea for irresponsibility 
in method but for greater imagination and flexibility in both general 
theory and specific research design. In one way or another, this plea 
appears often in the chapters of this issue of the Review. 

I wish to express my keen appreciation to all who have helped with this 
issue for their skillful, prompt, and conscientious work. Each contributor 
worked diligently to make this issue scholarly and, at the same time, mean- 
ingful and interesting. It has been a personal and a professional joy to 


work with them. 


Earl V. Pullias, Chairman 
Committee on Higher Education 
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CHAPTER I 


Students 


GEORGE D. YONGE 


ight major research trends and findings 


This chapter is an attempt to highl 
Only selected studies published since 


on the student in higher education. 
June 1960 are reviewed. 
Inasmuch as psychological 


by researchers, they are not independent of the 
to answer these questions. For this reason, a clarification and under- 


standing of methodology is essential to an understanding of the facts; 
some of the studies below are discussed with this in mind. i 

There is a necessary overlap between this chapter and the next, which 
deals with environments, since the psychological and educational impli- 
cations of student characteristics cannot be adequately understood apart 
from an understanding of environmental characteristics. 


facts are replies to specific questions posed 
specific methodologies used 


Freshman Enrollment by Major Field 


tical summaries by which differential 


enrollment trends among major fields can þe estimated. According to his 
surveys (Parker, 1962, 1965), full-time freshman enrollment increased 
by approximately 42 percent from 1960-61 to 1964-65. Nursing showed 
the greatest increase in enrollment (62 percent), whereas the increased 
enrollment in engineering (13 percent) and in agriculture (24 percent) 
did not keep pace with the overall rate of increase. Freshmen declaring 
majors in arts and sciences, business, and education showed increases 
in line with the overall trend. In both 1960-61 and 1964-65, approxi- 
mately 58 percent of th ere male. Further, the 
proportion of males in ea! ories was the 
same in 1960-61 as it was in 

The figures derived from Par 


Parker has provided annual statis 


e enrolling freshmen w 
ch of the above major field categ 
1964-65. 
ker’s surveys are only approximate pro- 
jections of overall enrollment trends by college major. In 1960-61, the 
freshman enrollments were based on data furnished by 858 institutions; 


in 1964-65, 935 institutions provided enrollment data. Furthermore, 
the deçlaration of a major is a tentative commitment for most freshmen. 
Since different major fields have different holding powers (Thistlethwaite, 
1960), these data should not be used to estimate other than freshman 
enrollments in various majors. 

Two studies suggest that increasing enrollments do not necessarily 
result in a reduction in ability level. Darley (1962) obtained ability 
253 


Review or EDUCATIONAL RESEARCH Volume XXXV, No. 4 


scores for freshmen entering 167 institutions in 1952 and in 1959. 
He found that the mean ability score for 1952 freshmen was lower than 
the mean score for 1959 freshmen, even though these institutions reported 
a marked increase in enrollment during this period. Hood and Berdie 
(1964) noted that, during the past 11 years in Minnesota, the proportion 
of higher ability students planning to attend college increased almost 
three times as fast as the total increase in high school graduates planning 
to attend college. The net result has been an increase in the intellectual 
level of college students over the past decade. To the extent that Minne- 
sota data can be generalized to other states, there is no reason to assume 
that the average ability of colle 


ge students is decreasing because of in- 
creasing enrollments. 


Selection, Prediction, and Persistence 


Selection 


Heist and Webster (1960) w 
consideration of nonintellective 
selection procedures, 
values, and interests of students 


acteristic: i : å 3 
a S as aptitude, interests, values, social background. 
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and i ispositi i 

E intellectual disposition. Differences in college environments were 

ks pointed out. Further, he considered the question of fitting the 
ent to the college so that optimum student development might be 


realized. 


Prediction 


m SpE, EOS); jue a nationally representative sample of approxi- 
mad a gra uates from 147 high schools, studied the correlations 
ean pe : ay Scholastic Aptitude Test (PSAT) scores, high school 
id bris j^ ege attendance. He obtained the following point-biserial 
aii RIT — college attendance and the three independent vari- 
v a verbal, 0.53; PSAT mathematical, 0.51; and rank-in-class, 
40. From his data, Seibel also organized a table reflecting the proba- 
bility of college attendance for various levels on the PSAT and rank-in- 
class dimensions. High scores on these variables appeared to be more 
closely related to college attendance for men than for women. Seibel 
also presented PSAT norms for students attending various types of in- 
stitutions. As in Darley’s study (1962), based on college freshman 
scores, there was a progressive increase in mean ability scores of students 
attending institutions according to the highest degree offered. There 
was, however, considerable overlap among the distributions of ability 
scores classified by the various types of institutions. 

It would be extremely difficult to demonstrate any direct effect of 


the writings of Fishman (1962) on the types of prediction studies cur- 
heless, many investigators have been moving 
ing intellective criteria (e.g, grades) 
ability tests). This trend, as slight 
e greater diversity among 
bility of using different 


rently in vogue. Nevert 
away from the old model of predict 
from intellective predictors (e.g 

as it may be, can be attributed in part to th 
psychological tests, which has increased the possi 


types of prediction and criterion 

Although grades are still a popular criterion, many investigators have 
displayed an interest in nonintellective predictors or correlates of grades. 
For example, Goodstein and Heilbrun (1962) studied the contribution 
of scores on the Edwards Personal Preference Schedule to achievement 
at three levels of ability. They found that personality factors contribute 
most to the prediction of the academic achievement of the average col- 
lege male. Heist and Williams (1961) reported significant differences 
among three achievement levels in terms of a system used to classify 
Strong Vocational Interest Blank profiles for degree of intellectual dis- 
position. On the basis of this scheme, the high achievers were described 
as being more strongly oriented toward inquiry and speculative and 
creative thought than the other two achievement groups. However, within 
this homogeneous sample of bright science majors, most other differences 


were slight. 
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The study reported by Brown (1962), dealing with faculty nomina- 
tions of ideal students, represented an attempt to study student char- 
acteristics related to certain nonintellective criteria valued by faculty 
members. Although the problems inherent in the use of nominations 
are great, this study reflects a refreshing attempt to deviate from the 
usually prosaic study of grade point average correlates. . 

Holland and Nichols (1964) focused on the prediction of intellective 
and nonintellective criteria by means of intellective and nonintellective 
predictors. Their criteria of nonintellective achievements were as predict- 
able as was their intellective criterion (self-reported freshman grade 
point average). It was generally found that the best predictors of 
self-report achievements were self-report estimates of potential achieve- 
ment. The question of predictor-criterion contamination needs to be 


raised here since both measures are, in many instances, based on self- 
report data. 


Nasatir (1963) published a stud 
rate of academic failure in m 
were three dichotomies: 
academic), context ( 
by the proportion of 


Persistence 


Perhaps the most 
and college outcome 


of this investigation is that the 


ambitious study of the relationship between ability 


Darley (1962). The major limitation 
ability data are based on 1952-53 

based on the 1956 academic 
tive findings regarding the distrib 
in four states (Minnesota, Ohio, T. 
ship of ability 
within some t 
to outcome; a 9men to remain in some institu- 
256 


October 1965 STUDENTS 


tions, and men in others. However, these differences were not systemati- 
cally investigated by control for such variables as major field and ability 
level. A comparison of high ability students (top 25 percent) indicated 
that types of institutions had different holding power with respect to 
these students. For example, in Minnesota only 47 percent of the high 
ability men enrolled in state colleges graduated, whereas 73 percent 
of the men enrolled in private coeducational colleges had graduated 
by 1956. 

Another aspect of Darley’s study dealt with distribution of ability 
among a nationally representative sample of 200 institutions. These 
data were based on the scores of 1952 entering freshmen. In view of 
the diversity of the mean ability scores at these 200 institutions, Darley 
concluded that every student in the country can find an accredited col- 
lege that will admit him, regardless of his ability level. In addition, 
Darley noted that diversity in ability level among institutions was the 
pattern no matter how they were classified (e.g., liberal arts colleges). 
The mythology of higher education by which certain types of institutions 
are regarded, without qualification, as superior in the ability level of their 
students to other types was not univocally supported by his data. 

Thistlethwaite’s (1963) study, dealing with the relationships between 
persistence and personality and environmental factors, has already been 
mentioned. Matson (1963) and Willingham (1962) reported that frater- 
nity members did not drop out of college as frequently as did non- 


fraternity members. 


Astin (1964b), using a sample of 6,60 wish MN 
the relationship between certain environmental and student characteris 


and dropping out of college over a four-year period. After taroen 
for differences in student input variables, the following Bip. un p- 
logical Inventory measures were found to be owed rel oa 
dropping out for both sexes: self-control (negative), flexibility po h 
and achievement via conformance (negative). None of the col eze = 
ables (e.g., private versus public control) were related to the “on 
criterion for men. For women, the masculinity of the college (e.g, P 

cent of males enrolled) was related positively to dropping pu as v: 
the private (versus public) classification of the e A al im 
being a women’s college were negatively related to the ropou € . 
Astin also found that the dropouts, compared to ‘apache er! 
lower socioeconomic background, had lower ranks in He scho 3 = 
were lower on several questionnaire items that assessed cg s 
demic motivation. The comparison of dropouts and as 

tive of the institutions from which they withdrew is a specific limitation 


of Astin's study. Darley's (1962) data, for example. mom es a 
stitutions differ markedly in their power to hold high : s ity ee n ; 
this at least suggests that college withdrawal may reflect different psycho- 


logical phenomena at various institutions. 
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Interaction of Student and Environmental Characteristics 


The development of the College Characteristics Index (CCI) to meas- 
ure environmental press and of the corresponding Activities Index (AI) 
to measure psychological needs (Stern, 1962b) has stimulated research 
and thought concerning the interaction of stu 
characteristics. In considering some of these studies, it may be well 
to focus on certain methodological difficulties connected with the use of 
the CCI and AI as well as with certain extensions of the ideas of Stern 
and Pace proposed by Astin and Holland (1961). 

For example, it is difficult to defen 
of the CCI and AI because both sets 


dent and environmental 


the interaction is already 
hence, artificial results may be 


A Using a sample of Psychology students at 
one institution, showed that there was no correlation between CCI scores 
and the paralle] Scores on t 


not compelling, 
their environments differently 
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a variety of institutions, are needed before the interpretations of Pace 
and Stern can be accepted with a high degree of confidence. In the 
light of the published data, it is difficult to maintain a sharp distinction 
between student and environmental characteristics as measured by the 
AI and CCI. 

The Environmental Assessment Technique (EAT), first put forth by 
Astin and Holland (1961) and later used in a series of articles by Astin 
(1962; 1963a, b; 1964a, c) relating institutional, student, and EAT dimen- 
sions, illustrates a concerted effort to obtain independent measures of 
student and environmental characteristics. The EAT measures eight char- 
acteristics of college environments: size, average intelligence, and six 
personal orientations (based on the proportions of students graduated from 
a given institution in major fields) classified as realistic, intellectual, 
social, conventional, enterprising, and artistic. In a subsequent study, 
Astin (1962) factor-analyzed 33 college attributes and reported six 
principal dimensions: affluence, size, private versus public, masculinity, 
realistic emphasis, and homogeneity of the environment. In another study, 
Astin (1964c) factor-analyzed 52 student input variables and obtained 
six major dimensions: intellectualis, estheticism, status, leadership, 
pragmatism, and masculinity. In the most recent study of the series, 
Astin (1964a) compared the presumably independent estimates of stu- 
dent and institutional characteristics derived from the above studies. 
He reported that characteristics of entering freshmen classes were 
highly related to institutional characteristics. That is, the high ability 
students were exercising a high degree of self-selection in deciding where 


to attend college. 

In terms of the methodolog 
or environmental attributes are bas 
bodies (e.g., percent graduated in realistic fields) , 
acteristics are defined in terms of the characteristics of present student 
enrolled in realistic fields) . However, it is difficult to 
aracteristics are transformed because 
And if this temporal dis- 


y proposed by Astin, certain institutional 


ed on the characteristics of past student 
whereas student char- 


bodies (e.g., percent 
see the magic by which student ch 
of time into environmental characteristics. temp dis 
tinction is valid, some of the environmental characteristics in Astin's 
studies will have to be considered student characteristics with reference to 
still previous generations of students. Rather than providing independent 
estimates of student and environmental characteristics, Astin's method- 
ology may merely be a way of estimating the extent to which. the 
processes of formal selection, self-selection, or both remain relatively 


constant from one generation of students to another. 


These methodological questions render some of the interpretations of 
Nonetheless, Astin, Pace, and 


the above-mentioned studies ambiguous. \ à 
bution to the literature dealing 


Stern have provided an inestimable contri t a 
with the student in higher education. Their pioneering studies are truly 


breakthroughs; they have shifted the research emphasis from a descriptive 


to a dynamic model. 
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Self-Selection 


The phenomenon of self-selection of institutions by students, noted 
above, is corroborated by other studies taking a slightly different ap- 
proach. McConnell and Heist (1962) explicitly recognized this phe- 
nomenon in their chapter, as did Stern (1962a) implicitly in his 
chapter, in The American College (Sanford, 1962). Heist and others 
(1961) found that students (future Ph.D.’s) attending institutions of 
high and low scholastic productivity 


ague universities were more strongly 


and more independent, original, and flexible than students attending 


public universities. Furth 9 entered Roman Catholic institutions 
were generally less intellectually oriented and more authoritarian than 
men who entered Protestant insti 


itutions. These were National Merit 
irtually any institution of their choice. 
gave a very brief but instructive illustration of 
desirable students (formal selection) provide 
age of the institution, allowing student self-selection 
m es over bed te Die. approntati Tet 
S process i ; 

of bot: select ud disi tudent and a certain type 

Clark (1960, 1962), by developing the theme of institutional images 
or student self-selection, shed considerable light 
n. He considered how images might be 
and he suggested some areas of needed 


titution, self-selecti 
ership in stude 


or field, regar 


5 ding t isi tai in 
special programs such as h E the decision sf adiri: ee 


Both within and 


as in the individual. That is, the figure- 
gun Mm of the environ (its meanings) are modified by stu- 
ent characteristics such that in possible choices (meanings) are 
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Summary and Conclusions 


It was noted that the average ability level of college students is not 
necessarily decreasing even though enrollment continues to increase. In 
the area of selection, there is a growing concern for the inclusion of 
environmental and nonintellective variables in the selection process. Non- 
intellective predictors have been used to predict both intellective and 
nonintellective criteria to a greater degree than was previously true. 
Perhaps the most salient and important research trend to emerge was 
the systematic investigation of the interaction between student and en- 
vironmental characteristics. These studies focused on the correlations 
among measured student and environmental characteristics as well as the 
study of student-college self-selection. Perhaps this shift from a predom- 
inantly descriptive to a more dynamic level of analysis may be considered 
a major breakthrough in the sociopsychological study of the student in 
higher education. 

Two areas of research conspicuous by their absence were studies 
of student satisfaction with regard to educational experiences and the 
study of faculty attitudes, values, and interests and their relationships to 
student characteristics. To the extent that faculties do influence student 
development and aspirations, this neglected area of research is essential 
to an adequate understanding of student-environment interactions. — 

Studies are needed to help clarify certain methodological difficulties 
mentioned in the present chapter, and many of the studies reviewed war- 
rant replication at other institutions and with students less gifted in terms 


of measured scholastic aptitude. 
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CHAPTER II 


Campus Environment 


WILLIAM B. MICHAEL and ERNEST L. BOYER 


Since Pace and McFee prepared their chapter for the October 1960 issue 
of the REVIEW, the direction of research on the college environment has 
been toward the refinement and consolidation of past efforts. Although 
both individual workers, with substantial grants-in-aid, and agencies such 
as the Center for the Study of Higher Education, the Bureau of Applied 
Social Research of Columbia University, and the National Merit Scholar- 
ship Corporation have been conducting re: 
ful fashion, little innovation has resulted 
numerous large-scale research investigati 
and verification of the construc 
contributors in the late 1950s. 


search in a systematic and care- 
- Nevertheless, there have been 
ons that constitute a validation 
ts and theory developed by imaginative 


Summaries and Bibliographies 


Since 1960 there have been several important summaries and biblio- 
graphic contributions concerned with r 


lished during the past five year: 


d s, it 
in this chapter. 


Intellectual Climates 


Expanding considerably upon his earlier and larized 
paper (Pace, 1961) concerned with diversi nd somewhat populari 


Pace (1962) described the items and scales f th l teristics 
Index (CCI), identified four factor di Sne College Chara wis 


out similarities and differences in the 
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dimensions, and enumerated several methodological approaches to the 
study of college environments. He also related how, in terms of their 
response to items of the CC/, students at San Francisco State College held 
a perception of the intellectual climate at their campus that was quite 
different from that formulated by Riesman and Jencks (1962a) in their 
subjective and impressionistic anthropological commentary on the culture 
of this college. 

From the data furnished in a large-scale survey 
135 American colleges and universities, Davis (1963) undertook the study 
of one variable, intellectualism, in terms of the percentages of students who 
responded to certain questions about the purposes of a college or univer- 
sity. When a student at a given campus answered a question relative to 
whether it was important to him personally, his response was interpreted 
as a measure of the true value climate of intellectualism at that campus. 
When each student responded to the same question as being most im- 
portant to the typical student at the campus, his response was taken as a 
measure of the perceived climate of intellectualism. An index of quality 
for a given college was based on the average level of ability scores of the 


of 33,982 seniors at 


freshmen enrolled at that college. . . 

Briefly, the data analyzed by Davis revealed the following five fin ings: 
(a) A large proportion of students at private, small institutions of high 
quality endorsed intellectual values, whereas only a small proportion of 


students at lower quality institutions (principally large qq mee 
and technological schools) supported intellectual values. ( / : e iso 
of perceived values was positively related to the climate e ums pb 
(c) The perceptions that students held regarding an PEE ion: ber 
were distorted toward their own values. (d) peche xis delis 
relatively high grade point average in ge ue wore of the 
tively low grade point averages tended to furnish ORE is amare be 
intellectuality of their campuses. (e) Relatively marked cna nin 
tween the degree of perceived intellectuality and se. Th à findings were 
associated with the geographic location of thercollege. k "i social percep- 
meaningfully interpreted within a theoretical i aes 
Son involving riii F vent st and ve paper concerning college 
As previously mentioned, the mos who 
eny ronnen to appear since late ven ee epi key 
presented not only a brief history of prior piv th dimensions of sev- 
terms but also a description and discussio? n t risas of a sample of 
eral college environments and the personality ee 


wee ; or external. 
more than one thousand students. Casan. cu the indi- 
i unterparts $ 
P ? and the ps chological counterp 7 
on as pees an psy ress is inferred from actions 
vidual as “needs, 


ized that ; 
i ps eiie E at needs are inferred 
done to the individual in some $ 


je ifested by the individual in his striv- 

fol 1 teste! H 
from the characteristic bebe man ^t. Stern also described 
in ucture an 
g to structu that had been develope 


interpret 
briefly two instruments 
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measurement of needs and press: (a) the Activities Index (AI) consist- 
ing of 30 scales of 10 items each to which the respondent indicates his 
like or dislike of each item and (b) the previously cited CCI, which is 


liking or disliking a definite schedule for studying. The items in the CCI 
scale are concerned with policies, procedures, impressions, attitudes, and 
activities that may be characteristic of various types of undergraduate 


A much abbreviated Summary of the conclusions that Stern drew from 


à vast amount of previous research with the 47 and the CCI appears to 
be in order, Particularly for the reader who is unfamiliar with such work. 
A few of the major conclusions follow: 


scales reflecting emotional controls 


dents Preparing for work in the same fields, with th 


os wlien matched with education majors have shown lower intellectual 


een determined between profiles on 
sampled. Specifically, three rather 


majority of colleges studied have te 
characteristics) ; (b) the 


highest standing on the intellectual 


e^ 
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for AI and CCI responses, Stern identified six environmental factors 
extracted from the intercorrelations of the CCI scales as intellectual orien- 
tation, social effectiveness, play, friendliness, constraint, and dominance- 
submission. He then proceeded to report systematically how the percep- 
tions of students who were attending two groups of colleges at opposite 
(high and low) positions on an intellectual climate score differed with 
respect to each of the environmental scales on the CCI as well as to each 
of the personality scales of the AJ. Numerous comparisons were made be- 
tween 11 high-ranking and 11 low-ranking institutions in intellectual 
climate relative to (a) size and composition of student bodies, (b) re- 
gional representation, (c) type of institutional control, such as public, 
private, or denominational, (d) type of program, such as liberal arts, 
teacher preparatory, or professional, (e) highest degree offered, (f) 
student activities, (g) financial assets, and (h) tuition. Personality char- 
acteristics of students in the colleges of high and low intellectual climate 
were also compared and evaluated. : 

In the main, the conclusions reported in his summary of previous 
investigations with the 4/ and CCI were sustained. Certain specific find- 
ings may be noted. Derived from an unweighted sum of scores on 14 CCI 
scales that were grouped in four components identified as substantive 
intellectual interests, academic motivation, social effectiveness, and self- 
assurance, this intellectual climate score exhibited correlations of 0.49 
with the percentage of National Merit Scholarship finalists among enter- 
ing students, 0.59 with the number on such scholars at all class levels, 
and 0.83 with a verbal subtest score of the College Entrance Examination 
Board Scholastic Aptitude Test. Stern interpreted these gi re 
findings to indicate that colleges choose their students ene care dl 
than the students select their colleges. Most of the colleges high in intel- 
lectual climate—the so-called institutions of academic excellence—were 


à : " ivi i liberal 
found to be relatively small, residential, and esca Le was in 
" rana ] education program wa 
arts colleges in which (a) a strong genera n + 

D HE is activities and hard work; 


existence; (b) stress was placed on scholarly poten oe: 
(c) personal autonomy, nonconformity, and intellectual fre 


£ respe i ) proiessors 
S ; were highly valued and resp cted; ( 
tudents and faculty were highly x ; d f ; 


were available to talk with students inforr 4 ts were numer- 
and (e) cultural activities and intellectually-oriented me d low in 
ous and well attended. On the other hand, ae ^ Ec (a) the 
intellectual climate were largely public netos nm M ges than 
orientation was toward practical and vocational aoea dim 
toward intellectual or cultural activities s ënds 2: ET (b) ro et 
the prevailing attitude was not necessarily vi swa Re onc: 
in social relationships and in patterns of vgridtet Deed (a) a 
(c) opportunities for extracurricular pae des matters as class 
siderable organizational, bureaucratic emphasis on s 3 


attendance and departmentalization of the curricula existed; and (e) the 
culture and the academic community was 


separation between the peer 
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relatively well defined. From the standpoint of personality, students in 
colleges with high intellectual climate in comparison with those in institu- 
tions with low intellectual climate tended to (a) possess stronger intellec- 
tual interests; (b) have a greater desire to understand themselves as well 
as others; (c) understand political and social problems and be able to 
find solutions; (d) exhibit somewhat lower dependence needs; (e) show 
less self-indulgence; and (f) demonstrate more spontaneity in emotional 
expression. 

Stern implied that future use of his instruments and procedures would 
ers to specific questions of concern to 
college faculties, (b) a means for the comparison of the expectations not 
only of various student and faculty groups within a college but also of 


College and Student Characteristies 


Among several Papers prepared by workers at the National Merit 


Scholarship Corporation, four articles concerned with the characteristics 
of students and colleges were especially 


33 major qualities of coll 
(1962a) defined si i 


allocations, 


In a factor-analytic study of 52 į i 
nput variables f i had 
responses from more than 127,000 inen ay DR b ors 
(1964c) found six factors which he identi ae 


i a fied as intellectualism, estheti- 
cism, status, leadership, masculinity, ragmatism. In a second report, 
Astin (1964a) noted that the asp. ms of entering freshmen were 
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harmonious with the characteristics of the institutions they selected. More- 
over, in multiple-regression analysis each input factor with the exception 
of leadership could be predicted with a high degree of accuracy from the 
known characteristics of the institution (the median cross-validated mul- 


tiple correlation being 0.85). 
Additional References: Pace (1962) ; Sutherland and others (1962). 


New Seales and Assessment Procedures 


In their description of a method for measurement of college environ- 
ments which they called the Environmental Assessment Technique (EAT), 
Astin and Holland (1961) proposed eight characteristics of the student 
body—size, average intelligence, and six personal orientations identified 


as realistic, intellectual, social, conventional, enterprising, and artistic— 


and correlated these characteristics with the CCI scales. Having shown 


the stability of the six personal variables to be relatively high, the writers 
discussed the practical advantages of the simplicity of the EAT and 
pointed to its uses in the study of student-faculty interactions, in the 
formation of comparisons of the characteristics of different institutions, 
and in the conduct of cross-cultural investigations. Subsequently, Astin 
(1963b) furnished additional validation data for seniors at 76 colleges 


and universities. 
In a new, carefully standardized 150-item instrument known as the 
(CUES), Pace (1963), who 


College and University Environment Scales 


had demonstrated a lack of parallelism in the organization of factors in 
d for measures on five factors of the 


the AI with those in the CCI, provide ures o Net 
college and university environment which he identified as practicality, 
community, awareness, propriety, and scholarship. Relative to the 30 
scales of the AJ, each of which consists of only 10 items, the five 30-item 
scales of the CUES afford at least three important advantages: (a) a more 
parsimonious evaluation of the institutional differences in qm i 
environments, (b) greater score reliability, and (c) scores that could be 


i tive data. 
related to somewhat more representative norma 
Two expository papers, concerned with the measurement of student char 


acteristics and with the assessment of the quality of fetu o von 
learning respectively, were those of McConnell (1963) and Stuit ( , 


each of whom cited several relevant research studies. 


Student and Faculty Subcultures 


Anthropological Studies 


ve years, several contributions have appeared that 


During the past fi the study of college cultures— 


represent an anthropological approach to i 
xir student "i faculty subcultures. Concerned with the patterns of 
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values, beliefs, and prescribed modes of behavior of various student and 
faculty groups at different colleges, these anthropological studies repre- 
sent highly subjective, impressionistic, and literary essays, or social 
commentaries, that would not meet the scientific criteria of objectivity, 
control, and replicability ordinarily required in educational and psy- 
chological research. Nevertheless, these investigations, involving largely 
informal observations, may be viewed as rich sources of hypotheses and 


insights. Since most of these studies are highly speculative, they will be 
treated briefly. 


ge in Michigan. Riesman and Jencks (1962a) 
rancisco State College, to which 
: (as stated on page 265). Jencks 
and Riesman (1962) also furnished a case study es the patterns of 
University. Two other descriptive studies 
tures were those of Bushnell (1962) concerning 


Vassar College and of Hughes, Becker, and Geer (1962) regarding a 


medical school. 


reader's under 


imited formulation o 


Empirical and Theoretical Contributions 


From interviews, 


PU self-reports : i 
at a large state university, Gottlieb and Hotline [ -pepierpe 


October 1965 Campus ENVIRONMENT 


for groups of college students develop relative to changes in value orienta- 
tions and attitudes. Their rationale was basically consistent with New- 
comb’s (1962) extensive theoretical arguments regarding peer group 
formation. In postulating that the individual has a need for self- 
consistency, Gottlieb and Hodgkins suggested that subcultures develop 
when students (a) minimize the importance of dissonant goals espoused 
by the college and (b) seek support of their families and college peers 
with comparable value systems as a means of reinforcing their existing 
value orientations. If one posits a hierarchy of goals for students which, 
though similar in substance to those of the college community, are dif- 
ferent in order of affective value, it is possible for the principal charac- 
teristics of each of several subgroups of students to be described in terms 
of their order of preference for certain goals. After presenting their 
rationale, Gottlieb and Hodgkins defined a subculture as “a segment of 
the student body at a given institution holding a value orientation distinc- 
tive of that of the college community and/or other segments of the student 


body." 


Student Perceptions of Faculty and Fellow Students 


vealed informative data concerning 


the perceptions that students hold of a faculty. In his study of highly 
talented. students, Thistlethwaite (1960) requested them to evaluate on a 
checklist the three college professors who they thought had contributed 
most to their desire to learn. He noted that the students overwhelmingly 
elected instructors who had shown great enthusiasm for their disciplines. 
At least 80 percent of the participants pointed out that these teachers 
had exhibited an interest in them as well as in their work. 

For a sample of college students who were judged to come from. lower, 
middle, and upper social classes, Gottlieb (1962) found that virtua y 
all students chose as their ideal teacher one who would be iae pei 
in his presentation, who would know his field well, and who e a 
students produce. Students from a low social class who were also re atively 
low achievers preferred instructors who would help them in an qudm 
way with their career plans, whereas students from a low weh ass m 
were relatively high achievers appeared to depend on adults = teac! : : 
for considerable emotional support. There was a marked tenc ae «i 
high achievers in all social groups to stress originality, ica. i" 
dedication to their scholastic endeavors. Since at all achievement evels 
male respondents from the middle and upper social classes, in — to 
those from the lower social class, indicated that their parents served as 
sources of influence and encouragement, it would appear that college males 
from the lower social class may be relatively dependent upon a college 
instructor or adult authority figure for encouragement and reinforcement 


if they are to do superior work in college. 


Three empirical investigations re 
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Employing 180 items from the CCI divided into two groups of 90 
each—one group pertaining to college students’ perceptions of fellow 
students and the other to students’ perceptions of faculty members— 
Nunnally, Thistlethwaite, and Wolfe (1963) carried out two M 
factor analyses of these two sets of items, which had been administere 
to 500 freshman and sophomore students at the University of Illinois. 
Relative to students' perceptions of the characteristics of faculty, the six 
dimensions isolated were described as systematized energy, toughness, 
availability to students, interest of lectures, interest in arts and humanities, 
and vocational emphasis. With respect to the perceptions of the charac- 
teristics of fellow students, six factors were identified to be intellectual 
drive, personal appearance and manners, competition, science interest, 
pressure against scholarly activities, and interest in visiting speakers. 

Additional References: Richardson and Blocker (1963) ; Rubin (1963) ; 
Taylor (1962) ; Teahan (1963) ; Veldman, Peck, and McGuire (1961). 


Relation of College Environment to Productivity 
Although the principal em 
lems associated with outco 
behavior of college students i i i 


nctions, and opportunities 
nt and its subcultures” (p. 250), Brown (1962) 
research workers at the 


ship Corporation; the model related to desired 


outcomes in the college experience such factors as (a) 


peer groups and facult 
bles associated with f 
dispositions ( abilities, personality traits, and career an 
educational plans), and (d) societal i 


à on, Brown argued within a socio- 
psychological theory, as have Fishman (1 


962), Pace (1961, 1962), and 
Stern (1962a, 1963), that efforts should ae | 
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For example, Thistlethwaite (1963) presented evidence that the climate 
of undergraduate colleges that is associated with the subsequent produc- 
tion of future scientists differs substantially from that of undergraduate 
colleges noted for their preparation of scholars in the social sciences and 
humanities. Among several other publications that furnished informative 
empirical results concerning the relationship of college environments to 
productivity were those of Astin (1961; 1962b, c; 1963a, c: 1961b), Astin 
and Holland (1962), Nichols (1964), and Thistlethwaite (1960; 1962a, b). 
Several of these papers offered somewhat contradictory findings concern- 
ing the productivity of Ph.D.’s as a function of college environment in 
view of the use of both different criteria of output and various kinds of 
Statistical adjustments to correct for differences among colleges relative to 
(a) ability level of students, (b) major fields, and (c) such characteris- 
tics as intent and aspiration. Detailed consideration of the substantive con- 
tributions of these research studies concerned with productivity may be 


found in Chapter III. 


Conclusion 


It is evident from the several contributions just reviewed that many well 
designed and relatively sophisticated studies based on substantial empiri- 
cal findings have given support to various theoretical positions regarding 
(a) the sociopsychological nature of college environments, (b) student 
characteristics, (c) faculty and student subcultures, and (d) student per- 
ceptions of faculty and of other students. In light of expanding college 
enrollments and probable changes in the value systems of students, of 
faculty, and of college and university administrators during the years 
ahead, a center or bureau of institutional research might advantageously 
undertake periodic studies to assess these changes, especially if continued 
improvements are to be realized in the selection and placement of college 
students and in the evaluation of modifications of student behavior and 
attitudes relative to dynamic institutional objectives. In paruen an addi- 
tional carefully planned investigations involving use of contro greps 
are needed to ascertain whether changes noted in attitudes of students 
occur in relation to differences in college environment or, in fact, take 


place in spite of any formal college experience. 
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CHAPTER III 


Outcomes of College 


ERNEST L. BOYER and WILLIAM B. MICHAEL* 


Wh 
To ae a drogom of college? What happens to those who remain? 
oe at modus = lege outcomes persist in later life? These questions 
if the Bonam perd researched during the period covered by this issue 
Gebeoud ens l ough most of the recent research has duplicated and 
ienasi n i Aem e, exceptions can be noted. Of special conse- 
meds ies is period have been investigations which were designed to 
E dera nme adults at the same time that changes among college 
Mem cde eing assessed. In light of data from these and other studies, 
cation of earlier conclusions is indicated, and new areas of inves- 


tigation are introduced. 


Early Withdrawal from College 


i the isa d 1960 issue of the Review, the much-examined college 
of aan em has been further probed. A survey covering four decades 
Student : was completed; an investigation of attrition among graduate 
en was reported; and, most significantly, a cluster of studies based 

a new and realistic definition of the college dropout has appeared. 


Rate of Attrition 


kill (1962) reviewed 40 years of 


In The American College, Summers 
dies conducted between 1920 and 


um dropout research. Citing 35 stu 
0, Summerskill disclosed, in a decade-by-decade analysis, a 53-percent 


iu rate for the 1920's and a 50 percent rate fon the ee 

r LO s and 1950s, the rates of attrition were 49 percent and 51 percent, 

espectively. For the aggregate of these 35 studies, a median student loss 

of 50 percent was reported. 

fe Success among students at the gra 
one notable exception, Wright 


duate level remains largely unexplored. 
= (1964) traced the progress of 176 
graduate students for 11 years. Of 115 master’s candidates enrolling in 


ee 46 received no degree, 58 earned the hoped-for master's degree, and 
completed a doctorate by 1961. Of the 61 Ph.D. aspirants studied, 20 


—— 
* 
cud The authors wish to acknowledge the assistance of Paul Heist, who reviewed 
constructively evaluated a draft of this chapter. 
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successfully completed the doctorate within 11 years, 8 terminated their 
studies with a master’s, and 33 failed to earn any further degree, although 
19 of the 33 already had a master’s degree at the time of enrollment. In a 
search for determinants of graduate school success, Wright found intel- 
lectual endowment to be not significantly tied to the acquiring of a master’s 
degree. A relationship between the student’s social adjustment and his 
success or failure was reported. The candidate's age was correlated in- 
versely with the earning of a graduate degree. Because success in Wright's 
study was defined as the securing of the desired degree within an 11-year 
period from the institution of first enrollment, students who may have 


transferred to other universities or those still enrolled in a degree program 
at the same university were not identified. 


College Factors Related to Withdrawal 


Personal and precollege variables, 


a such as high school rank, perform- 
ance on aptitude tests, socioeconomi 


c background, and hometown loca- 
predictors of undergraduate success 
dropout pattern of 1,080 National 
1964) explored the extent to which 
mortality and survival of students. 
istics, including affluence, size, and 
al and social, as well as 6 dichotomous 


ng females, the institution’s 
robability. This departure of 
ale student bodies may occur, 


Redefinition of the College Dropout 


Typically, 
fails to recei 
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ve the bachelor's degree from the institution of first enroll- 
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ment within four years after matriculation. Such a definition fails to dis- 
tinguish between a student's withdrawal on a temporary basis and his 
permanent loss to higher education. 

Research completed since 1960 has suggested that to restrict the study 
of the dropout to a single institution and a four-year span may be incom- 
patible with actual student behavior. Jex and Merrill (1962) revealed, 
for example, that although only 20.2 percent of the 1948 University of 
Utah entering class graduated on schedule, 45 percent of this class did 
graduate from the University within 13 years. 

In a more comprehensive study, Eckland (1964a) traced the academic 
careers of 1,180 male University of Illinois students for 10 years. Although 
586 students, 49.7 percent of the total, received a degree following a period 
of continuous attendance, only 33 percent of the graduates actually com- 
pleted their degrees within four years. Of the 594 students (50.3 percent) 
who left college before graduation, 417 graduated from some college within 
10 years. In addition, 60 of the dropout students held senior standing when 
the study was terminated, and the investigator judged them to be potential 
graduates. In short, approximately 70 percent of the original freshman 
class received a baccalaureate degree within the 10-year period. . 

Eckland (1964b) argued that variables often used as dropout predictors 
may be of limited service if a student dropout is defined as one who per- 
manently withdraws from college. Fifteen items frequently associated with 
undergraduate success or failure were classified as either intellectual or 
nonintellectual. An attempt was made to determine the power of each 
item to predict each of four levels of achievement: low achievers (fresh- 
men who failed), early dropouts (students who left college during the first 
year), all dropouts (all students who left higher education and did not 
immediately re-enroll), and permanent dropouts. Intellective factors vs 
as high school rank, aptitude tests, and college grades did — kip y 
with college persistence, but the predictiveness proved unstable. Higl sch o 
rank, for example, was twice as effective in predicting a student sin ia 
dropout as it was in predicting the permanency of such departure. Among 
nonintellective factors, some forces generally presumed to bear upon 


college performance (i.e. parents’ income, religion, a po 
actually reversed their predictive direction; e.g» ee 3 iim] 
dropout from college was directly related to a suburban backg , 


staying out of college permanently was inversely related i 
origins. Eckland concluded that predictive items often Ha i» h 
college attrition may fail to predict the return and graduation : ie drop- 
out. In a few instances, an item's power to predict ultimate m = 
annulled completely when allowance was made for dropouts who E qot 
Thus, recent studies indicate that it is incorrect to assume that eight 
consecutive semesters are the normal progression to graduation; the Amer- 
ican college student may complete undergraduate study at tem than: one 
institution over a longer period of time. This shift in atten ance patterns 
has implications for the theory and design of future research in the field. 
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Changes in Knowledge and Cognitive Skills 


Although acquiring information and developing cognitive skills are 
highly regarded aims of education, colleges rarely determine directly and 
empirically the extent to which these objectives are achieved. Course 
grades are the accepted index of academic progress, and passing marks at 
the end of each term are judged to be a priori evidence that the amount of 
knowledge has been expanded and the skills of cognition sharpened. 


Grades 


erage. 
Holland (1961) and Locke (1963) challenged directly the use of grades 


ic Progress, Holland developed an achieve- 
riginal papers Published, prizes won, 

y the student and found that these 
related to grades, In parallel findings, Locke 
pt | "initiated activities per- 
1€ not correlate with academic performance 


ituation. Both Writers 
call 3 
ment of more adequate student achievement criteria Gia iiai 


at Buffalo, Chapman (1961) noted no si E aie qe ei m n 
average between an accelerated gr 
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examination, and a control group, who completed a regular sequence of 
courses at a conventional pace. 

Ramsey (1962) found that differences in academic behavior among 
Harvard Law School freshmen drawn from five types of undergraduate 
colleges were a correlate primarily of undergraduate grade point average. 
Differences in academic performance also were related to father's occu- 
pation, family income, type of secondary school attended, religious affilia- 
tion, and region of residence. Although there were moderate differences 
in average undergraduate college grades for groups of students of the same 
high ability level who belonged to identifiable subcultures, these group 
differences in grade point average tended to disappear in law school— 
a leveling outcome which Ramsey hypothesized to reflect different student 
perceptions of what the school environment required. He concluded that 
individuals with contrasting cultural orientations perceived academic roles 
in different ways and that these perceptions shifted during the student's 


academic career. 


Critical Thinking Ability 


Since 1960, changes in critical thinking ability among college students 
have been investigated. Two comprehensive reports prepared for the U.S. 
Office of Education (Lehmann and Dressel, 1962; Lehmann and Dressel, 
1963) as well as a published condensation of this research by rium 
(1963) disclosed a significant shift in the critical thinking abi ii a i A 
Michigan State University students. A test consisting of 52 verba pro i. 
lems designed to measure five critical thinking factors was administere to 
one class at matriculation and again during the senior year. Significant 
differences in test-retest mean scores indicated improvement In the pee 
to define a problem, select pertinent information, recognize pss Nn 
unstated assumptions, draw valid conclusions, and formulate and s 


relevant hypotheses. 


Aptitude 


Although the intellectual capacity of students is aee ee 
stable during the college years, Webster, Freedman, "e Pies sd 
argued, both from theory and from empirical data, that for de sae n 
capacity to learn is in a highly undeveloped state and that shi 


ili i ig! ifted students. Nichols 
ability can be expected, especially among highly E 7 digo cdi 


(1964) reported on the test-retest performance o € Racord Bennie 
on the verbal and quantitative sections af i ee ie res of students 
tions (GRE) Aptitude Test. Increases in the aptitude sco d perform 
from the freshman to the senior year were noted. The "i E I i h 
ance was correlated with the type of college attended an EE y jas 
the student's major. Students majoring in economics, mathematics, an 
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chemistry showed increases in their quantitative scores relative to their 
standing on the verbal scale, whereas students majoring in humanities 
tended to increase their verbal scores relative to their quantitative scores. 
Nichols suggested that shifts in ability, insofar as such are indicated by 
mean score responses on the GRE Aptitude Test, reflected the impact of 
the study program into which the student had been channeled. 


Changes in Personality 


Empirical data recently summarized in two research surveys clearly 
indicate that changes in personality occur among college students. With 


changes, since changes among noncolle 
In The American C ollege, Webster, 


lege data to support generalizations 
development. After four years at 
more mature but less stable, (b) less 


d more tolerant, and (d) more liberal 
4 Freedman, and Heist (1962) 


regarding college impact on student 
Vassar, students tended to be (a) 


ski, 1961), a private liberal arts college ( Parkes, 1962), six California 
junior colleges (Telford and Plant, 1963), a state college (Plant, 1962), 


and a large public university (Lehmann, 1963; Lehmann and Dressel, 
1962). This same instrument was readministered t 


later, and in all cases the retest means earned in 
were significantly lower than wer, 
females became more open-minde 
two or more years of college. 

A reduction in ethnocentrism over 


four l ted b 
Lehmann and Dressel (1962) and by om ipu ne 


Lehmann (1963). Scores on an 
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State University. The move was toward more flexibility and less rigidity. 
On the other hand, a study by Foster, Stanek, and Krassowski (1961) did 
not reveal a correlation between a decrease in ethnocentrism and duration 
of college attendance. As Lehmann and Dressel (1963) proposed, however, 
this latter finding may reflect the somewhat homogeneous religious posture 
of the sample tested. 

An intensive empirical and descriptive study at Sarah Lawrence College 
(Murphy and Raushenbush, 1960) traced the progress of 46 girls from 
the freshman through the senior year. Data obtained from faculty inter- 
views, as well as from various psychological and ability tests, led the 
investigators to conclude that during the four Sarah Lawrence years 39 
of the 46 students had developed a strong sense of social responsibility, 
gained confidence in personal relationships, and developed clearer self. 
roles. 

In his attempt to evaluate the impact of a specific curriculum on non- 
intellectual changes among college students, Plant (1964) administered 
measures of dogmatism, ethnocentrism, and authoritarianism to freshmen 
who had been invited to participate in a special humanities program at San 
Jose State College. After two years, mean changes on the three scales were 
reported, both for those who had completed the program and for students 
who had elected to enter the college’s general education program. Although 
both groups changed significantly toward decreased dogmatism, ethnocen- 
trism, and authoritarianism, the humanities group had the larger net gain. 
Reservations must be made regarding both the process of selecting partici- 
pants and the introduction of a possible Hawthorne effect associated with 
the uncontrolled enthusiasm in the experimental group. : 

Longitudinal studies under way at the Center for the Study of Higher 
Education at Berkeley concern attitudes toward civil liberties and "reete 
freedom held by students at three liberal arts colleges. Although McConne 
(1962) reported significant attitude differences from campus a campus, 
he concluded that, in general, freshmen are less permissive b ea sic 
and that the change from the freshman to the senior year 18 We B 2 
tolerant position regarding matters of civil liberties and academic freedom. 


Control Groups 


The most significant development during the review period um aa 
the inclusion of control groups in studies designed to "po Lyn 
among college students. In the December 1964 issue 0 ee poet 
McCullers and Plant reported on two investigations (Plant, i s : 
and Plant, 1963) in which both a college and a suse popu m o 
young adults were studied to determine whether results a a big 
attributable to college experience or to noncollege d n the : — 
these studies, Plant not only tested after four years - A who ad 
enrolled and persisted at San Jose State College but also readministere 
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three personality scales to subjects who had attended college for more 
limited periods or who had not attended. He noted that not only students 
who had attended college one or more semesters but also young adults who 
had not attended college became less dogmatic, less ethnocentric, and less 
authoritarian during the four-year period. 

Telford and Plant (1963) administered the Study of Values, the Dogma- 
tism Scale, and five scales of the California Psychological Inventory (CPI) 
to all students applying for admission in 1960 to six junior colleges. A 
sample of 1,713 students, retested after two years, were grouped by sex 
and by level of college attendance. Young people not attending a junior 
college were grouped together, as were those who had enrolled for one 
year or less and those who had attended three or four semesters. During 
the two years there was a reduction in dogmatism for all students, irre- 
spective of the period of college attendance. Similarly, neither college 
enrollment nor length of attendance was related to test-retest changes on 
the five CPI scales. The results for the Study of Values revealed greater 
variability than the results for CP]. Because of its structure, however, the 


onal attainment groups changed to a signifi- 


cant degree and in the same Irection, irrespec ive ratio: 
d th d " pect: of the du ation of college 


To ven eA been enrolled for more 
ars, 
University for more than and (c) students who had attended the 


two but fewer than three yea i 
e ^ 
mental group and the three control years. When the experi 


3 groups were co i 

major trends emerged: (a) For both meles Peu ies 
, 

» regardless of the amount 


experimental and control 
to new ideas over the four-year period, but length of college attendance was 
not related to the extent of change. (c) Students of both sex ho had 
attended college continuously for four lestie 


ded : Years became more flexible and 
relative in their system of values, Whereas those attending college for three 
years or less became more traditional in their value orientations 


284 


October 1965 OUTCOMES oF COLLEGE 


Student Values in Relation to College Behavior 


Following another theoretical formulation, Bidwell and others (1963) 
argued that the value structure which the student brings to college deter- 
mines to a greater degree than any other factor the nature of his under- 
graduate career. After typing students as either traditional or emergent, 
the investigators studied three sets of variables: the student's value com- 
mitments, his system of personal goals, and his behavior in the college 
setting. The student's values were reported to be predictive of such under- 
graduate behavior as choice of major, peer relationships, extracurricular 
activity, and future career plans. The researchers concluded that personal 
value commitments move the student with ever-increasing selectivity toward 
predicted behavior in both the academic and extracurricular spheres. 


Evaluation of Previous Conclusions 


The widely discussed Jacob survey, reviewed at length by Bloom and 
Webster in the October 1960 issue of the Review (pp. 326-27), suggested 
that although sociological forces are at work on the college campus, basic 
values of students remain largely unaffected. The appearance of numerous 
subsequent studies in which changes among the college population were 
identified with respect to both attitudes and values as well as with respect 
to more generalized dispositions of personality led Sanford (1962, p. 806) 
to assume a more hopeful view concerning the impact of college experience 
than that adopted by Jacob. Research findings now indicate, however, that 
changes among students heretofore attributed to college factors may in fact 
be developmental changes that occur among brighter than average young 
adults irrespective of college attendance. So viewed, the college ji einn 
may act simply as a catalyst to speed changes that would occur or e y 
as the individual matures (Plant, 1962; Plant and Minium, 1964). B yet, 
neither those personality outcomes classified as uniquely collegiata aS 
those factors of the college context that may significantly ms ros 
and value changes of the student have been precisely located (Le 


and Payne, 1963). 


Changes in Educational and Vocational Goals 


j ise plans for further 
tly change majors and revise plan 
ion tn dde m b for shifts in aspiration were not 


schooling, but the forces responsible É ies indi 

accurately identified during the period under review. a I 
cated that college environmental factors determine ame raTa Tiedt 
whereas others suggested that the so-called productivity of a college reflects 
primarily the characteristics of the students it admits. 
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Academic Major 


Pierson's (1962) survey of bachelor's degree candidates at Werder 
State University revealed that 30 percent of the class of 1958 graduate 
in majors other than the ones selected upon entering the University. a 
most frequently named reasons for changed majors were (a) lack of knowl- 


edge about the extent of curricular opportunities in the University, (b) 
content of courses in the original major, and (c) 


€ subculture with which they are identified does 
aflect career plans, 


d not be predicted from aptitude 
test scores, whereas the j i 
during the freshman 


sciences, whereas 
the social sciences 


Educational Aspiration 


Thistlethwaite (1960) 


compared the academi ; ] of 
aspiration of 1,500 coll 'educue major and leve 


ege juniors with th 


goals for further education while they were in college. Students had freely 
shifted to new fields of study, with th 


declared majors than the biological sciences, However, both these fields 
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attracted new students less frequently than did the humanities. Student 
testimonials and ratings of faculty press by means of Stern’s College 
Characteristics Index (CCI) were used to support Thistlethwaite's con- 
clusion that college professors significantly affect the vocational choices of 
students. 

A subsequent study of National Merit Scholarship competitors by 
Thistlethwaite (1962) again linked the faculty to shifts in the graduate 
training plans of students. After two years of college, 33 percent of the 
subjects increased their levels of aspiration, whereas the educational goals 
of 9 percent were lowered. The majority (58 percent) did not change 
their aspiration levels. After employing an analysis-of-covariance pro- 
cedure to control for noninstitutional variables, Thistlethwaite concluded 
that the academic goals of the student were influenced by (a) supportive 
faculty members, (b) the student’s participation in an honors program, 
and (c) members of the faculty who pressed for independence, creative 
activity, and initiative among students. 

Whereas Thistlethwaite (1962b), in a third study, sought to extend and 
to confirm his findings, Astin (1962) challenged both the methodology 
and the statistical procedure employed in this research. Especially ob- 
jectionable was the technique of asking students to recall, after three years, 
their precollege educational goals. In his own attempt to explore the 
impact of college on the student’s educational plans, Astin (1963a) 
launched a four-year panel study of 6,544 high aptitude students attending 
82 institutions. Twelve separate college variables were tested, and an 
input-output design was employed to control for 21 separate noninstitu- 
tional factors. Astin reported that aspiration for the Ph.D. degree appeared 
to be negatively affected by (a) student body size, (b) percentage of males 
in the student body, and (c) the conventional orientation in the college 
environment. A complex relationship also was noted between the social 
orientation of a college and the extent to which students raised their levels 
of aspiration in terms of reporting an intention to earn the Ph.D. degree. 

In a companion study, Astin (1963b) investigated the relationship 
between characteristics of the college and the decision of 1,548 students to 
pursue a career in science. Six factors from the Environmental Assessment 
Technique (described in Chapter II of this issue of the Review) plus four 
other general characteristics—aflluence, size, homogeneity, and — 
linity—were used to assess the college. Astin reported that, for male 
students, men’s colleges in the Northeast influenced negatively the science- 
bound student, whereas technical institutes and public universities exerted 
a positive impact. Among female students the affluence of the college 
appeared to be negatively correlated with the selection of a science career. 
Since, in the main, the degree of correlation between college characteristics 
and career choice was meager, Astin concluded that precollege studént 
characteristics appeared to be much more important as career determinants 
ollege attended. Sustaining this con- 
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clusion in a subsequent study, Astin and Nichols (1964) found that the 
types of rewards and satisfactions sought by the student were the strong 
determinants of vocational choice. 


Persistence of Change 


The extent to which developmental changes during the college years are 
maintained after graduation has not been extensively examined, although 
Freedman’s (1962) chapter in The American College should serve for 
some time as a reference for past research. In his review of the study of 
young Vassar alumnae, Freedman concluded that developmental changes 
during the college years do persist among graduates. No large-scale shifts 
in ideology or attitudes among alumnae were reported. Three to four years 
after graduation the only significant personality changes were in the direc- 
tion of greater emotional stability, less anxiety and depression, and general 


psychological well-being. Since personality stabilization among Vassar stu- 


dents actually began after two years of college, Freedman observed that 


Vassar juniors and seniors, viewed developmentally, were closer to young 
alumnae than to freshmen and sophomores. 

Two other empirical studies examined the stability of, or change in, 
attitudes of older graduates. Using the California F and E scales, Freedman 


surveyed Vassar graduates for various decades, going back as far as 1904. 
Substantial differences amon 


class of 1904. measured 
the class of 1914. Sinc 
parent, however, Freed: 
noted at the time of te ere not due to age but reflected differences 
ion that had persisted over the years. 

up, Rettig and Pasamanick (1960) reached 
i - Responses from the 1932, 1942, and 1952 
classes of Ohio State University alumni to a 50-item moral-judgment scale 


moral issues varies significa 
severe the judgments, 

Since both the Freedman and the Rettig and Pasamanick studies em- 
ployed a cross-section design to stud 
the century, the attitudes of 
are not known. Thus one can 
the earlier value systems per: 
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Conclusion 


Research during this review period has served to modify earlier notions 
regarding the impact of college life and to identify major issues yet to be 
explored. For example, a clear and useful distinction has been drawn 
between students who temporarily leave college and those who permanently 
withdraw. In light of this, prior estimates of the number of students who 
drop out of college must be revised. Further, selected changes in student 
personality heretofore attributed to college attendance now appear to be 
dimensions of maturation that occur among both college and noncollege 
young adults. 

In general, this period of research has been more successful in chal- 
lenging old assumptions regarding the outcomes of college than it has 
been in establishing new generalizations. This achievement is useful but 
not sufficient. College attendance is culturally valued, and a sustained 
expansion of higher education is predicted. It seems reasonable to urge, 
therefore, that the nation's enthusiasm for higher learning be equaled by 
à commitment to study empirically the educative process. This research 
effort should be directed toward the identification and unification of the 
ways in which colleges and universities influence students and the forces 
on campus that assist or hinder institutions of higher learning in the 
fulfillment of their ambitious goals. 
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CHAPTER IV 


Educational Programs 


RALPH R. FIELDS, JOHN W. MASTIN, and JAMES P. WALSH 


Because of the Coverage in other chapters, the present review is confined 
to certain aspects of the educational program: general and liberal educa- 
tion, community college programs, flexibility in undergraduate programs, 
and curriculum development. 
During the period under review a number of general works have ap- 
peared that are helpful to the student of curriculum in higher education. 
The Encyclopedia of Educational Research, edited by Harris (1960), a8 a 
i > and contains helpful discussions and bibliographies 

programs in higher education. 


The American College, edited by Sanford (1962), includes treatments of 


many problems in higher education. The H, 
ing, edited by Gage (1963) 
of research on teachin 
Smith (1961) have brought together a val 
materials in the two 
mentary History, 


General and Liberal Education 


The literature on general and liberal ed 
apology, criticism, polemic, and a continued search for values. The pres- 
sures of numbers, the changing goals of students, and the necessity for 

management have led critics to suggest that curriculum 


if general education concepts and the 


ge are to survive, Implicit in the literature is the suggestion 
that research in Beneral education must se 


the assumptions on which curriculum devel 


without reason, then, that studies, though meager, 
deserving of scrutiny, 


ucation is characterized by 


draw conclusions Írom poor examples, Mayhew (1960), in the volume 
General Education: An A 


widely accepted features 
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colleges and universities diverse in size, sources of support, character of 
student body, scope of educational activities, and age and tradition of the 
institution. 

Sanford (1962), as part of a study sponsored by the Society for the 
Psychological Study of Social Issues, edited a substantial volume in which 
the findings and conceptions of psychology and the newer social sciences 
are brought to bear upon the processes of education in college. Critical 
of the tendency of colleges to be governed by tradition and dogma, the 
contributors examined systematically the liberal arts college as a social 
institution and its effect on the personality of the student. In the same 
volume, Katz and Sanford (1962) viewed the liberal arts curriculum as 
too often based on assumptions or as a body of knowledge identified with 
the curriculum of the graduate school. 

The role of general education drew varied interpretations. Cowley 
(1960) cited the belief in some quarters that general education properly 
belongs in secondary schools. He cited the disinterest of a substantial 
portion of faculty members, the dominant status of the research function, 
and the present major system as tending to emphasize special rather than 
general education. DeVane (1964), noting the trends toward early and 
narrow specialization as more students press toward graduate and profes- 
sional schools, expressed concern that the widespread advanced placement 
movement may sacrifice the cohesive effect of the common curriculum in 
the early years of college. However, to the extent that secondary schools 
relieve colleges of basic studies such as English composition, calculus, and 
foreign language, DeVane stated, advanced placement had merit. Sonne 
ing that honors programs and advanced placement reflect sound educationa 
standards, Haywood (1963) concluded that the average or above pec 
student could expect less in other courses. Siphoning off the best faculty 
members to teach the best students reduced the quality of the rest of € 
program by presenting unequal opportunity to unequal talents. e» rhe 
logical grounds, Vallance (1963) suggested reversing pP idi c ve Brel 
liberal arts program to permit concentration in a selected : im cuba. 
two years and liberalization in the junior and senior years, when inte 


powers are more mature. g : E 

In a study restricted to cataloged documentation c en ee 
leges, Rudy (1960) found a trend in which general veia ur — ie 
in varying degrees to the first two years, followed by specie a anized on 
professional training. Reflecting this emphasis, faculties - o TEA 
the basis of specialized department areas of scholarly an ra i 
interest. In only a few nationally renowned Herel t eg iS Pic olis 
Catholic colleges was the trend less sweeping. St. John's r im lee ie i 
Maryland, represents a case having little influence on the m 
development. i i f 14 independent liberal 

McGrath (1961, 1963a) Lees E eiae win He bend that subject 


arts colleges and the related cos É 
matter aeng and course proliferation produced not only a meager 
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body of undergraduate common instruction but also many courses of a 
postgraduate character with few students and an overworked and underpaid 
faculty. By managing the curriculum more economically, reducing course 
offerings, and increasing enrollments, the colleges studied could raise 
salaries and reduce teaching loads. The view that faculties had the major 
responsibility for revising curriculum and reducing course offerings to 
defensible bounds was shared by Thomas (1962). McGrath (1963b) 
further viewed advanced placement as promoting earl 
the extent that some colleges unwittingly had becom 
professional and graduate schools; limiting students to 
was seen as changing the traditional constituencies of | 
He saw the future determined by a continuing concern 
percentage of American youth beyond the top 10 perc 


y specialization to 
e prep schools for 
upper ability levels 
iberal arts colleges. 
for a considerable 
ent. 


Community College Programs 


€ of steadily expanding enrollments 
the proportion of new students has 


a. DeVane (1964) con- 
that he was unwilling 
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trends d i 
keen mde ome d problem of what vocational programs 
denen dba iton (1960) traced the origins and the historical 
a a ghi A pior college concept, examined its philosophical 
rna e e ow this type of college represents a response to 
Bia Pte pone society. 
eer "E through a careful study of the history of the American 
PU EA Junior Colleges, brought into focus many of the develop- 
= eg gs rea problems of the community and junior colleges. 
brn fy i ege movement could profit by many more historical 
Tae roii dae further into its second half-century of existence. 
abun ae carried out at the Center for the Study of Higher 
rome eph a e University of California (Berkeley). In a study of the 
I 4 peg public higher education, McConnell (1962) pre- 
Lx» ES ysis of public junior colleges which is particularly valuable 
on of its perspective. Through an intensive case analysis, Clark 
MC F shed a great deal of light on the problems which confront an 
EE that admits, and tries to provide programs for, all who wish to 
nter its doors. It is to be hoped that other studies will build solidly upon 


these two. 


Programs 


Unfortunately, relatively little research. dealing specifically with the 
program of instruction has been reported during this period. À number 
of doctoral studies, reported in the Junior College Journal (1964) as 
completed or under way, dealt with such curricular aspects as the factors 
bearing on performance of low ability students, criteria for selective 
admissions policies, cost differentials in instruction, and staff morale. Pro- 
physical education, and other fields were also 

as were programed instruction and evening 
63) reported the experiences and accomplish- 
ege in its television offering, including 


grams in music, nursing, 
subjected to investigation, 
adult programs. Erickson (19 
ments of the Chicago City Junior Coll 
some assessment of student characteristics, achievement, and persistence. 
In a national study of public and private junior colleges’ curricular prac- 
tices for low ability students, Schenz (1964) found that a much greater 
proportion of public community colleges than of private junior colleges 
accepted low ability students as their responsibility and made special cur- 
ricular provisions for them. 

Several other problems related to curriculum development were studied. 


Gross and Maynard (1962) examined the gaps and duplications as well as 
the evidence of articulation between the high schools and junior colleges 
in the area of social science. Harris (1964) dealt with problems of technical 
ed the justification and some of the problems 
in the junior college. Rushing (1963) dealt 


e of advisory committees. 


education. Nall (1962) examin 
of terminal general education 
with the establishment and us 
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The accomplishments of junior college graduates are one of the impor- 
tant indications of the instructional quality of the institutions. Knoell and 
Medsker (1964) undertook a fundamental study of the problem of transfer 
between two-year and four-year institutions. Among their findings were 
that the characteristics of the junior college graduates were indeed varied; 
that the type of institution to which the junior college students transferred 
appeared to influence quite strongly the grades that were achieved; that 
the first semester in the new institution seemed to involve adjustment 
problems and a slump in grades, although steady improvement thereafter 
was normal; and that the graduation and persistence pattern was quite 


good, with 45 percent graduating in the allotted time and 31 percent still 
in attendance. 


eam teaching, independent study, and 
college programs. (e) 


found that California junior 
computer education. Light 
the need for health workers 


296 


October 1965 EDUCATIONAL PROGRAMS 


One of the serious lacks in the development of semiprofessional and 
technical education is careful and considered development of experimental 
programs and systematic assessment of the results of such programs. 
Nothing comparable to the pioneer investigation by Montag (1959) was 
carried out during the period under review, but it is possible that the 
report by Harris (1964) could constitute the foundation for such under- 
takings in the future. Harris’ analysis clearly indicates the magnitude of 
the task that should be undertaken by the community college. 


Flexibility in the Undergraduate Curriculum 


Cole and Lewis (1962) visited 13 institutions in 1958 and found that 
(a) advanced placement and advanced standing were promising methods of 
providing flexibility; (b) credit by examination was receiving increasing 
attention; (c) early admission was losing favor; (d) overloading was 
generally discouraged; and (e) honors programs were generally approved 
and productive. Radcliffe (1961) reviewed advanced standing and con- 
cluded that it has merit for motivating superior students and individualiz- 
ing programs, although there is a marked lack of evaluative data. Hatch 
and Bennet (1960) reviewed the literature through 1958 on independent 
study and concluded that (a) such study is as effective as conventional 
methods but has not realized its potential, (b) there is a trend toward 
starting independent study in the freshman year and offering it to average 
as well as exceptional students, and (c) it is not a substitute for classroom 
learning. They also presented nine hypotheses for research. 

Baskin (1960) examined a number of independent study programs and 
suggested that their real value lies in improved learning and individualized 
curricula rather than in economy of teaching resources as has frequently 
been assumed. In an experiment at Antioch College, Baskin (1962) tested 
the hypotheses that students in independent study courses would learn as 
much and be as satisfied as students in regular lecture-discussion courses. 
He concluded that (a) there were no significant differences in leaming E 
retention after two years; (b) independent study need not be € o 
superior students; (c) achievement may depend on personality factors 


as much as on academic ability; (d) students were initially y ini 
with independent study but became increasingly satisfied as they became 
uctor time 


accustomed to the technique; and (e) no real saving of instr r 
was evident even though Toom time was significantly reduced "i 2 
experimental courses. Baskin indicated that the greatest bene: s o e e- 
pendent study may be increased reliance of the student on sinn or 
his education and the use of his resources. Dressel (1965) pointed out 
that the ability to study independently is a primary goal of higher 


education. — 
Chickering (1964) found personality factors to be more significant 
than academic ability in predicting success in independent study at God- 
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dard College. Dearing (1965) reinforced the findings of Baskin and 
Chickering. Grollmes (1964) polled 850 colleges and universities in 1962. 
Of the 622 that responded, 356 had special provisions for superior students 
and 94 had four-year programs. Felder (1964) surveyed 520 four-year 
institutions and found that over 300 employed independent study programs. 
Goldberg and Kurland (1965) reviewed and summarized programs for 
abler students. 

Although flexibility measures are receiving widespread acceptance in 
four-year institutions and are beginning to receive attention in two-year 
colleges, they are aimed primarily at exceptional students and are not, 
as yet, based on thorough research. 


Curriculum Development in Higher Education 


Trends and Examples 


Criticisms of Programs of general and liberal education have suggested 
substantial revisions In curri 
curricular factors, testing, teaching aids, 


and integrated courses, Mayhew (1960) view. 


Development. 


Much future interest. will be focused i 
presented by Hayward (1963, 1964), e key haria E pen 
is a three-level (not freshman, sophomore, junior, senior), nine-term, four- 
p calendar providing on-campus, field, and vacation erm experience: 
conomical operation is achieved by Year-round operation with 50 percent 
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more students but with only two thirds of the student body on campus in 
a given term. 

Undergraduate curricula have had an increased relationship to the 
community college. Dressel (1963) noted a trend to eliminate from the 
four-year college occupational and skills courses of dubious academic 
respectability and to relegate them to the community college. 


Searcity of Curriculum Research 


Curriculum study has not been accompanied by substantial research. 
Fahey and Ball (1960) evaluated the program in general education 
initiated in the Liberal Arts College of the University of Pittsburgh in 
1955. Experimental students taking core courses in English, humanities, 
social science, and natural science did as well as the control group, and 
more of them graduated. There was no convincing support that they per- 
formed as well on subject matter and critical thinking tests as the control 
group. Although the subjects performed better on subtests in social science 
and humanities, they did not do as well in natural science. Brinker (1960) 
studied courses taken by liberal arts students in four liberal arts colleges 
of the Southwest to validate the presumption that a well educated liberal 
arts graduate should have at least an acquaintance with humanities, social 
science, and natural science. The study found that humanities and social 
science majors failed to obtain an elementary acquaintance with disciplines 
in other fields and that graduates of three of the colleges concentrated in 
their majors, sacrificing breadth for depth. There was some evidence that 
advisers were motivated to enhance the prestige of their own departments 


rather than to see merit in other disciplines. 


Curriculum Change: Factors and Process 

A few studies dealt with the factors in curriculum change in institutions 
of higher education, or with the process of changing the curriculum E 
any level. This is an area of investigation badly in need of serious research; 
any attacks on the problem of change are indeed welcome. E 

Goodlad (1964), as a result of a careful analysis of the pine B 
reform movement," formulated recommendations that would A to is 
the separate discipline approach to curriculum building f a s: eee 
with the total program. Since a somewhat similar problem 3 i i p ja 
exists at the college level, his analysis is worthy of consideration by 
students of higher education. . . = 

Katz and Sanford (1962) maintained that in spite of its importance 
has been made the object of systematic investiga- 


the "curriculum rarely a À estiga 
r continuing ex perimentation 1s 
was that g 


tion." The major point expresse 
a great need. 
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Miles’ (1964) analysis of the nature of educational innovation and the 
accounts brought together in the volume entitled Innovation in Education 
are suggestive of the complexity of change as well as of possible approaches 
to implementing it. 

Johnson (1964), by his accounts of innovations in community colleges, 
underlined the many and varied shapes that changes in instruction take. 
It is apparent also that changes in teaching result many times in program 
changes. The increasing introduction of new media constitutes mounting 
pressure for program change. The analysis by Brown and Thornton (1963) 


new media in college instruction, 


College curricula respond to the climate of public opinion, and the 
statement of the Educational Policies Commission (National Education 
Association and American Association of School Administrators, 1964) 


(1960) of great interest. 
Finances have always affected the nature of 


» that educational con 


Based on the careful analysis of data from studies of two-semester instruc- 


-week trimester arrangement lead- 

ars. They reported several advan- 
r: better s lents 

and faculty; maximu i i paced uer Torre 


will suffice, 
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CHAPTER V 


Teachers and Teaching 


RUTH E. ECKERT and DANIEL C. NEALE 


Although swiftly mounting college enrollments have alerted even the lay 
citizen to the great shorta 


for instructional adjustments 2 
restricted in Scope and, in many instances, in imagination and scientific 
rigor. Of the several hundred articles and brochures published since 1960 
on staffing and teaching problems, most describe current or recommended 
practices or urge a greater concern for quality —but they provide few 
principles to guide this quest. 

Contributions of the period include significant ide 
and experimental design, 
and occasional investigations which have set n 
research. Hopefully, all these should stimulate 


Some references are included here because of 
gest trends or illustr. 


as regarding theory 
ge in a few types of studies, 
ew and fruitful patterns for 
better studies in the future. 
research merit; others sug- 
ted literature. 


The College Teacher 


Spurred by a 
in the late 1960's, a numbe. 


ducation (1961) symposium on 
d articles by Adelson (1962), Katz (1962), 


» based on questionnaire and 
mer teachers in selected fields 


October 1965 TEACHERS AND TEACHING 


"occupational image," and "institutional potency"—involved in recent 
studies of faculty recruitment. 

Interest in college teaching as a possible career appears to have increased 
substantially among undergraduate students during the past few years. 
O'Dowd and Beardslee (1961) found that students in five Northeastern 
colleges had a well defined and attractive image of the college professor 
and compared him favorably with all other major professional groups, a 
finding also stressed by Corcoran (1961) and Kinnane (1962), both of 
whom reported serious consideration of this field as a career. 


Preparation 


Although the controversy continues regarding appropriate : graduate 
training for college teachers and some new programs have been instituted, 
evaluative studies are still lacking. Berelson (1960) concluded, from the 
extensive factual data and opinions gathered in his study of graduate 
education, that the current research-oriented Ph.D. program can properly 
continue to serve both the prospective teacher and investigator. In contrast, 
McGrath (1961) supported his plea for special programs for college teach- 
ers with information gathered from several hundred college posse 
concerning shortages or lacks in graduate training. Recent efforts to stu Y 
this situation by field, illustrated by Hartung (1961), who investigate 
biologists, and Perkins and Snell (1962), who examined the opinions ana 
professional achievements of historians, should throw more light a is 
problem. A survey by Ewing and Stickler (1964) revealed that 91 lea ang 
universities—more than triple the number involved two decades aa p 
currently offering courses on college teaching or other phases of mg 


education. 


Orientation and In-Service Training 


using questionnaires, supplemented with 


In a carefully designed study Y fonnd that practically all 


interviews in selected institutions, Tracy (1961 . i 
accredited North Central colleges provided some faculty ES a 
of these programs had been initiated recently, soie a i D ndn 
well developed. McCall (1961) and Gustad (1963), who aso aY ers in 
orientation practices, confirmed Tracy's conclusion that new 


i i i . Exploring role conflicts and 
particular need help on instructional probleme. m d ees Hiit 


congruences through successive interviews wi nev Ne 
single institution, Allen and Sutherland ( 1963) boe" agp ed 
tutional responsibilities for staff orientation. Only m ce 
(1963) surveyed longer-range faculty development procedures, y 


of small Southern colleges. 
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Status and Services 


Reports issued biennially by the National Education Association, Re- 
search Division (1963, 1964) and the U.S. Office of Education (Dunham 
and Wright, 1964) on the size, composition, and academic and economic 
status of the profession have improved steadily in quality and furnish 
valuable descriptive information for researchers. Also useful is the recently 
completed survey by Ingraham and King (1965) of “outer fringe benefits” 
in colleges and universities. Whether judged by salaries paid, associated 
benefits, degrees held, academic rank, or social esteem, the academic man 
is relatively better off than he was a few years ago. Bernard’s (1964.) study, 
which arrays evidences from statistical, biograph 
mentary studies, competently correlates and inter, 
about women faculty members. 


Few investigations of the cha 


ical, and personal-docu- 
prets current knowledge 


es it is research and scholarly writing rather 
crowded out by other responsibilities. Steck- 


a seriously researched roblem. The difficulti 
by Gustad (1961). whe ond 


s ized. Diffe i a 
also partially account for the pi dite: fon Beene he e 
(1960), which is less optimisti p . 


their jobs, a few leave th fession 

EI e profession, 

probably because of motivational f, vell as tiita faniary pay or 
shift positions, as Stecklein 
^ op (1 3 ng attraction-retention study 
att e University of Minnesota. Sagen (1963), using Murray's “needs- 
press" framework in investigating faculty satisfactions, found that both 


and others 
ed in their pioneeri 
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rat oni Er position, a conclusion also supported by Allen 
"i eriam eei studies of prospective and current faculty members 
SFE, in pee em knowledge regarding their backgrounds and 
emp Ses € much less has been learned about their interests, out- 
€ de [^g reign Judging from the literature, little study has yet 
M o t oy aie as individuals or organized groups, in the aca- 
ce y wore ut promising hypotheses developed in these exploratory 
ce on should, if pursued, yield knowledge about faculty comparable 
ow available for college students. 


College Teaching* 


Unquestionably the finest single source for researchers interested in 


instructional problems is the Handbook of Research on Teaching (Gage, 
1963) , which provides excellent discussions of theories of teaching, designs 
for research, and measures of teaching and learning. Three decades of 
college instructional studies are incisively analyzed and interpreted in the 
concluding chapter, by McKeachie (1963). In another useful volume, 
Theories of Learning and Instruction (National Society for the Study of 
Education, 1964), current knowledge about learning is reviewed and its 
relationship to teaching discussed. Popularly written interpretations of 
recent studies, including some that do not reach regular publication chan- 
nels, are provided in the U.S. Office of Education’s “New Dimensions in 
Higher Education” series, edited by Hatch (1960-65). A second supple- 
ment prepared by Eells (1962) extended through 1961 his invaluable 
annotated bibliography, College Teachers and College Teaching, raising 
the total references covered since 1945 to 5,656. 


Concepts and Theories of Teaching 


teaching methods is that too often 
theoretical framework. Presumably 
ethod variables, relate learning 


. A persistent criticism of research on 
it has not been related to an adequate 


the latter would help to clarify teaching m s 
theories to the complexities of educational settings, and stimulate more 


comprehensive studies of college teaching. In line with current efforts to 
develop such a framework for the entire field of education, Smith (1961) 


* This review was undertaken while the junior author was supported in part by 
grants to the University of Minnesota, Center for Research in Human Learning, from 
the National Science Foundation (GS541), the National Institute of Child Health and 
Human Develonenant. (EPOLHD-011360]1-HDP), and the Graduate School of the 


University of Minnesota. 
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has re-examined the concept of teaching and suggested a fruitful model for 
studying “episodes” in the teaching cycle. 

The actual development of teaching theories remains in its infancy, not 
a surprising finding since such development must draw extensively from 
research and theories in the youthful behavioral sciences. In this connec- 
tion, Gage (1963, Chapter 3) recommended that more use be made of 
prominent learning theories, e.g. 
theories. Little application of the concept of identification has yet been 
reported, but conditioning theory has 
research on programed instr 
ing, Ausubel (1963) has se 
the teaching of verbal knowledge and supporte 


Stress upon the limitations of "discovery" ap 

Several paradigms for research Suggested in the Handbook of Research 
on Teaching (Gage, 1963) hold considerable promise for developing and 
testing such theoretical formulations, McKeachie (1961) and Siegel and 
Siegel (1964) have also argued persuasively for frameworks which relate 


sses also have wan 


ed. Independent study, which 
as one of “the gl 


” 
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Lecture and Discussion Methods. No research projects comparable 
to the 1957-59 Miami University study of large-group teaching procedures 
(Macomber and Siegel, 1960) have been reported recently, but scattered 
comparisons generally support its findings that neither the initial acquisi- 
tion nor the retention of learning is substantially affected by teaching 
method. Thus Leton (1961), in a limited comparison of lecture and group 
discussion in a course in child development, found no differences that 
could be ascribed to the procedures used. 

A few studies attempted to appraise the effect of s 
sessions for those which a teacher customarily leads. Based on studies in 
11 departments at the University of Colorado, Hovey, Gruber, and Terrell 
(1963) found that the achievement of self-directed groups. meeting with- 
out an instructor for most scheduled sessions, was slightly superior, 
both at the course’s conclusion and 10 months later, to that of conven- 
tionally taught students and that the experimental students were also more 
curious. Yet the control students typically gave more favorable course 
evaluations, suggesting that students, like instructors, tend to resist the 
unfamiliar. Other studies, confined typically to one or two courses, yielded 
similar findings. Scattered evidence also suggests that systematic training 
for student-led discussions improves these and general course achievement. 
But few studies have attempted to analyze the thought processes or verbal 
behavior of participants or even to identify major learning goals. It is 
not surprising that research findings under these circumstances are slim. 


ubstituting group-led 


Independent Study. Provisions for independent study, both within 
and beyond regular courses, appear to be increasing. Felder (1964), who 
examined practices in four-year colleges, found that two thirds of the 
responding institutions currently had such provisions, and less than one 
half of these were restricted to superior students. " 

Antioch's carefully executed appraisal of independent win EE 
by Churchill (1961), showed that students in seven first-leve n 
education courses who spent approximately one half of the Le on 
time in independent study or small group meetings did as wel "i D 
as satisfied with the instructors and courses involved as were : ae 
attended classes regularly. But all three methods produced liig ys = 
“learning resourcefulness” or the achievement of skills and attitudes tav 


ing continued independent study. 
In comparisons made in several 

study procedures in a beginning psychology course 

taught students did slightly better than the experi i 

classroom tests, but McKeachie and others (1960) Lee lvement in the 

students developed a greater interest and sense of invo 

subject matter. : . " 
ia cle are nein ig en Dr Chia 


ences, research on their learning value is ] 
309 


REVIEW or EDUCATIONAL RESEARCH Volume XXXV, No. 4 


student, faculty, and employer perceptions, Wilson and Lyons (1961) 
reported generally favorable conclusions from their appraisal of work- 
study plans in 17 colleges. Based on information obtained from American 
students and from educators in the host countries, Weidner (1962) sug- 
gested that the better student-abroad programs have a substantial impact in 
the areas of general cultural values, political and international attitudes, 
and personal maturity. Clearly there is fertile ground for research on 
these and other off-campus substitutes for conventional instruction. 


Instructional Technology and Media. A remarkable feature of re- 
cent research on instructional technology and media is the swift change in 
emphasis from televised to programed instruction. Lumsdaine and May 
(1965) provided an excellent guide to the extensive literature in this area, 
much of which exists in mimeographed technical reports. Many reports of 
instructional media research conducted under the National Defense Educa- 


tion Act, Title VII, have been abstracted under the directi Allen 
Hees, nder the direction of 


ctures and other teaching tech- 
» Schramm found 3 that favored 
ruction, and 84 that showed no 
itudes of college students were 


ended j 
type of viewer, P upon the subject, the teacher, and the 


In the Chicago City Junior Colle 
Chausow (1960), television studen 
well as students in a f. 


ge studies reported by Erickson and 
ts, even those viewing at home, did as 
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Another strategy, recommended by Lumsdaine and May (1965), empha- 
sizes the investigation of theoretically-relevant instructional variables. 

The past five years include almost the entire history of research on 
programed instruction. Of 190 studies reported by Schramm (1964), only 
25 were made prior to 1960. Although many of these studies involved 
elementary and high school students, a significant proportion related to 
college-level instruction. 

As in research on television, comparisons of programed instruction and 
conventional instruction have been popular. Despite the limitations of such 
comparisons, it is interesting to note that, of college-level comparisons 
reviewed, approximately one third favored programed instruction and none 
favored conventional instruction. 

Carpenter and Greenhill (1963), who directed several representative 
ersity, concluded that achievement in a 
th programed materials 
Ripple (1963) reported 
mentary psychol- 
hat instructional 


studies at Pennsylvania State Univ 
mathematics and an English course was as good wi 
as with conventional instruction. In a similar study, 
a significant advantage for programed approaches in ele 
ogy. A common finding in such comparative studies is t 
lime is reduced when programed materials are used. 

Another kind of comparative study is illustrated by Moore and Smith 

(1964), who tested various ways of presenting the Holland-Skinner pro- 
gram in psychology, such as using machines or programed texts, asking for 
Written responses or merely having the material read, employing free- 
response or multiple-choice formats, and giving or not giving the student 
knowledge of results. As has been common in these studies, there were 
no significant differences on the criterion test between the various presenta- 
tion groups. Even when they presented the programs to groups by film- 
strips or television, thereby eliminating the usual self-pacing features, Car- 
penter and Greenhill (1963) could find no difference in learning outcomes. 
On the other hand, evidence exists that some ways of using programs may 
be more effective than others. Dick (1963), for example, found an advan- 
tage when students studied programs in pairs rather than singly. 
Research on student reactions has been complicated by the great range 
in quality of programed materials and by the novelty of this approach. 
Robinson and Lerbinger (1963) reported ge 
à programed workbook which supplemente 
course. But Banta (1963), who summarized evaluati 
gram at different institutions, found that student reac 
cantly from college to college. 

Despite the hope of psychologists th 
would readily be solved by programe à i 
revealed by research in this field are as yet quite modest. Perhaps its major 
value has been to stimulate clear specification of objectives and continuing 
revision of instructional materials on the basis of student learning. For 
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tion varied signifi- 


ems of college teaching 


at some probl 
d instruction, its contributions as 
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some aspects of instruction at least, programs can save time for both 
instructor and student, with no apparent loss in learning. 

As reported by Lumsdaine and May (1965), research continues on the 
use of a variety of other audiovisual aids including films and audiotapes. 
In the college setting, particular emphasis has been given to evaluation of 
the language laboratory, which seems to offer definite advantages in foreign 
language programs stressing audiolingual skills. Scherer and Wertheimer 
(1964), who compared two years of conventional instruction in German 
with instruction using an audiolingual method and language laboratory 
facilities, found the latter produced superior speaking skills but inferior 
translating skills. 

The central contribution of research on instructional technology and 
media in this period has been to help television, programed instruction, 
and language laboratories gain acceptance with college educators. A further 
task is to clarify how each of these can best be used in college instruction. 


Student and Teacher Characteristics 


The general climate for learni 
student and instructor performa 
related this factor to the nature 
occasional studies have explor 
traits in influencing learning. 
the unexplained variance i 

A few investigators ( 
1964) , who have related 
teaching method, have n 
findings suggest intrigui 
important exploratory s 
learning, McKeachie (1 
(affiliation, need for po 
with teaching methods i 


ng in a given college unquestionably affects 
nce. However, no studies have systematically 
and quality of college instruction, and only 
ed the significance of student and teacher 


» even though these may account for much of 
n achievement. 


interacted to some degree 
learning. Thus students who showed 


possibilities, but in the period of this review only Corcoran (1961) and 
McKeachie, Isaacson, and Milholland (1964) examined specifically the 


teachers as seen by their students. 
scales, Corcoran found that good 
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[a ave rate high in surgency (buoyancy and enthusiasm), comention 
ees — and cyclothymia (concern for people). Similarly, 
: E meg saacson, and Milholland (1964) discovered that young teach- 
rs who were ranked high by their peers on surgency, culture, and emo- 
ee stability were the most likely to be viewed favorably by students. 
xi n igni trail-blazing study, Solomon, Bezdek, and Rosenberg (1963) 
Ha ie eight factors, derived from reports by students and trained ob- 
pe i mew tape recordings of teachers’ behavior, to students’ gains 
die ES a and in comprehension. Findings from this study 
o st, as do those reported by McKeachie, Isaacson, and Milholland 

'), that teaching styles do influence learning. But much more needs 
to be discovered about how teachers’ personal traits influence their choice 
of methods and students’ success and satisfactions in learning. 

In general, recent studies of teaching methods and materials have, like 
their predecessors, yielded no clear-cut evidence of the superiority of any 
one approach. This unquestionably reflects problems in definition, design, 
and instrumentation; it also testifies to the strong common influence exerted 
by tradition and the competition for grades. Since students seem to acquire 
certain types of knowledge quite well on their own, teachers may properly 
stress skills of inquiry, interpretation, and application. And certainly in- 
novations in teaching can be tried more boldly, without undue fear that 
standards of quality will be sacrificed. 


Next Steps in Research 


The major impression gained from a review of recent literature on 
college teachers and teaching is that research on these topics still claims 
far too little time, effort, and money. While a few excellent studies have 
emerged, educators lack carefully planned and cumulated research to guide 
decisions on staffing and instructional problems. Creative contributions 
made by isolated investigators need to be supplemented by campus-wide 
studies and by well mobilized regional and national efforts. 

The start made in formulating theories of teaching should be vigorously 
continued in the years ahead. As theories are refined and effective ways 
are evolved to test them in college settings. they should contribute greatly 
to the improvement of teaching. The college faculty member, as well as 
the researcher, needs comprehensive and empirically supported theories to 


guide his efforts. 

Progress charted recently in 
hope for the next decade of ins 
parisons made using ambiguously 


defining experimental variables also raises 
tructional research. Clearly, the many com- 
defined experimental and control methods 
have added little to our knowledge of teaching and should not be continued. 
The experimental application of theoretically derived variables [see Lums- 
daine and May (1965) and McKeachie, Isaacson, and Milholland (1964) J 
constitutes one promising approach to such sharpened definition. Another 
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attack, which Gage (1961) cogently supported and which is well illustrated 
in the Solomon, Bezdek, and Rosenberg (1963) study, involves process- 
descriptive studies of teaching aimed at finding out what actually transpires 
in the classroom or laboratory. Further work along both these lines should 
aid in setting up meaningful experiments and in developing better theories 
of teaching. 

Current concern for strengthening the subject content of college courses 
should also stimulate better studies of teaching. One reason why research 
on teaching has been rather sterile is that it has often been considered in 
the abstract, without reference to the particular subject content involved. 
As more knowledge is gained about the structure of individual disciplines, 
both teaching methods and ways of evaluatin 
to be rethought. 

Research patterns illustrated in some of the studies noted above also 
merit far wider application. More comprehensively planned studies will 
enable investigators to utilize a broad array of information on input, 
process, and output variables and to identify important interactions as well 
as main effects of the experimental variables. But, as Pullias (1963) coun- 
seled, preconceptions about acceptable methods of research should not 
restrict the investigator’s vision. A fresh 
promote excellence in teaching far more t 
such imagination. 


Better ways also need to be found to diffuse research findings and to 
mein trial of promising teaching methods and techniques. Since stu- 
ents as well as professors tend to resist change, established behaviors are 


g their effectiveness may need 


idea tested in a limited way may 
han a sophisticated study lacking 


eans for stimulating experimentation 


e, many faculty members will continue 
ped in an earlier time. 


Tenuous, across-the-board effort holds 
problems ough on today’s massive instructional 
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CHAPTER VI 


Noninstructional Services 


JAMES K. WINFREY and DANIEL D. FEDER 


In addition to a great increase in the number of articles published during 
the period covered by this REVIEW, increasing attention to research in this 
field is evidenced by the appearance of the Journal of College Student Per- 
sonnel, a quarterly publication of the American College Personnel Associa- 


iptive, close on the heels 
of differing programs. 
ent housing, where re- 
ped more rapidly than in almost any other 
ption of counseling (treated in the April 1963 
issue of the REviEW). As different ways of performing the same services 
are identified, a question inevitably following is whether one way is more 


effective than another, or whether different outcomes in terms of student 


learning, attitudes, and behavior result from the different methods of 
operation. 


Programs and Personnel 


universities, 
Scott ( 


1 
less than 2,000 enrollment with those 


tions. Smallness of institution and 
and contact with, students was not 
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there has been a steady growth in the establishment of service units headed 
by a chief student personnel officer in such institutions since World War II. 
The respondents reported a median of 20 semester hours of graduate study 
in the field: they spent a median of 75 percent of their time on actual 
student personnel functions; and virtually all reported directly to the presi- 
dent of the institution. Although each of the recognized student personnel 
functions was performed by some of the respondents, those most often 
performed were personal counseling, discipline, and responsibility for 
student records. The wide variations observed indicated that the functions 
performed and the roles of those performing them were not well differen- 
tiated in the small institutions. 

Smith (1961) surveyed the role and function of the chief student per- 
sonnel officers in institutions with enrollments over 10,000. 'The most fre- 
quently encountered professional title was dean of students. Almost all 


these officers were responsible for discipline, orientation, student govern- 
ment, fraternities and sororities, housing, financial aid, student counseling, 
ddition, two thirds of this 


foreign student programs, and activities. In a 
sample reported being responsible for testing, placement, health services, 
responsibility also for vet- 


and student unions. About one half reported 
erans affairs, security and traffic, and admissions. About one third reported 


supervision of special therapy clinics and food services. The respondents all 
anticipated that their positions would become increasingly administrative 
with less opportunity for personal contact with individual students and that 
they would strive for increased faculty assistance in certain phases of these 
noninstructional programs. It was predicted that increased emphasis would 
be placed on student housing and financial aids. 

Hill (1961) surveyed the available studies on the selection and training 
of student personnel workers to date, noting the high degree of self-selec- 
tion of people into student personnel positions without well defined training 
patterns. He cited the need for more accurate role definitions and research 
on effectiveness of training programs for specific functions. Clark (1964) 
reported the development of a scoring scale for student personnel workers 
using the Strong Vocational Interest Blank. Using a sample selected from 


the active male membership of the American College Personnel Associa- 
and reliability coefficients 


tion, he derived a scale whose high validity = 
lend confidence to its use. Similarities of interests to those of psychologists 
e pattern which could be 


and social workers were noted with a distinctive 
used to improve selection of persons interested in the student personne 


profession. 
Reeves and Arbuckle (1963) and Reiske (1962) reported separate in- 
vestigations of attitudinal differences among student personnel administra- 
tors and persons with counseling orientation. The former evidenced a 
higher degree of authoritarianism, higher persuasiveness, and less sympa- 
thetic and understanding attitudes than did counselors. The differences 
appeared to be fewer in number and less extreme as the amount of profes- 

sional preparation among the student personnel workers increased. 
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Part of the process of establishing role identification has led to studies 
of faculty and student perceptions of programs. Fitzgerald (1962) studied 
faculty perceptions about student personnel services at Michigan State 
University. In general, faculty ranked the importance of personnel func- 
tions in relation to the degree to which each seemed to support the aca- 
demic purposes of the institution. Even among tenured, long-service 
faculty members, however, there was marked lack of widespread knowl- 
edge, and this was more manifest among newer faculty members. Degree 
of involvement with student activities was related more to knowledge about 
student personnel functions than to length of time in association with 
the institution. 

Several other individual institutional studies (Atwater, 1961; Beckers, 
1961; Brantley, 1960; Harry, 1960) yielded data indicating that a major 
problem for the student personnel administrator was that of communi- 


à justment needs of freshmen was de- 
veloped by Griffin (1960). Five major bl y 

freshmen did not confer with their ides ee inl Tus 
when such assi 


habits, (c) Personal probl 


groups and the con- 
een the two experimental 
groups—adjustment to 
groups but not for the 


P deteriorated. 
measured by predicted-achieved rd 
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« 5 
and . nonpreviewers," those not participating. In each male sample more 
previewers than nonpreviewers joined fraternities. A significantly greater 
percentage of scholastic dropouts occurred among male liberal nts non- 
previewers. A significantly greater percentage of the female nonpreview 
group were placed on scholastic probation. A greater percentage of female 
previewers than nonpreviewers made the honor roll. 

Ivey (1963) evaluated three different programs of orientation by means 
2 questionnaire responses of freshmen in mid-October of each of the 
eas P diera No sex differences were found, and attitudes remained stable 

r the three years. Students responded best to social and informational 
activities. 

Tautfest (1961) used a stratified sample of Purdue University fresh- 
men to get reactions to topics and treatment in orientation week. Strong 
concentrated interests in academic aspects, becoming familiar with campus, 
handling finances, student organizations, and intramural sports were the 
topics most preferred. 

Smith (1964) used the Bell Adjustme 
of over 600 Negro students. The most fr 
economic deprivation, health, home problems, and an oversensitive atti- 
tude toward people. When these facts were added to data on scholastic 
aptitude and achievement differentials, they appeared to show the effects 
of these problem concerns on achievement. 

Morehead and Johnson (1964) compared two groups of male electrical 
engineering freshmen exposed to different faculty advising programs, one 
consisting of about 40 hours of group meetings and the other of individual 
conferences. Neither program involved intensive counseling or required 
special competencies beyond those that faculty normally possess. The group 
whose advisers used the more systematic approach (experimental) had 
a significantly higher grade point average in the spring semester, a greater 
proportion of high achievers, and a lower dropout rate. The control group’s 


dropouts were of higher ability levels. 


nt Inventory with a freshman class 
equent concerns were identified as 


Extraclass Activities 


nt leader opinions on campus life on 71 
ved that student interest in activities was 
ore apparent on large campuses, while 
with traditional patterns. Wide varia- 
student’s perception of personal value 
as a response to campus tradition. 
of university student organization leaders and non- 


d no relationship between values, aptitudes, and be- 
However, the student leaders appeared to 
and educational backgrounds (Barker, 


Reich (1961) surveyed stude 
campuses in 34 states. She obser 
generally declining. Changes were m 
smaller campuses showed less break 
tions in responses indicated that a 
in activities was more controlling than w 
, In a comparison 
joiners, tests showe 
liefs and college participation. 
come from higher socioeconomic 


1962). 
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Williamson (1960) resurveyed 121 of the 125 colleges and oo 
of all types and sizes, originally surveyed in 1957, to pa cm n e 
in payment of campus leaders. In general there was a dec phe Aa 
percent of the colleges paying in 1957 to 78 percent paying in a 
ministrators generally felt that student leadership should be voluntary, a 
an element in community training, and were concerned that service pe 
cepts were being lost. Student leaders felt that the press of time 
finances would require them to seek other jobs if payment could not be 
made for leadership positions in student activities. . 

Tillman (1961) studied the academic, extracurricular, and noncurricular 
activities of married and single undergraduate male students. Differences 
were found favoring the married student in grade point average, outside 
employment, and self-support. The single student attended more dances and 
movies, engaged in more individual Sports, and spent more time in the 


student union and visiting friends. In other activities no behavior pattern 
differences were observed. 


a marginal relationship. O'Donovan and Leila (1964) 
cant relationship between involvement in nonacademic activities and 
academic achievement of junior and senior business majors. 

Roskens (1960) reported a postcollege questionnaire follow-up on 


activities of University of Iowa graduates from 1922 to 1953. Substantial 
correlations were obtained b 


etween college and postcollege participation 
and leadership. 
Bloland and Hall (1962) developed and tested a 
and projecting space and facility needs for Support of various student 
activities by virtue of student involvement and the focus of the institution 
iti i the total educational program. 


found no signifi- 


method for analyzing 


in public, and with fe: 


Fraternities 


From 1950 to 1960, fraterni 
sota dropped from 12 to 8 
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percent of the available potential for member- 
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ship, despite marked growth in the student body. Bacig and Sgan (1962) 
offered several testable hypotheses to explain the decline: changing times 
and attitudes toward education; racial and religious discrimination: ex- 
ertion of negative influences on members; and continuation of pledging 
and initiation rituals which alienate members from the rest of the campus 
culture. 

Trumpe (1962) developed an instrument for evaluating college social 
fraternities with the help of 253 college deans on campuses where fra- 
ternities existed, national officers of 33 different fraternities, and 143 
fraternity chapter presidents. Psychophysical in nature, it correlated 0.458 
and 0.485, respectively, with rankings made by members of the inter- 
fraternity council and panhellenic board based on scholarship, campus 
leadership, intramural activities, and their personal concepts of what con- 
stitutes a well rounded fraternity. 

Matson (1963) studied the effects of place of residence upon academic 
achievement. The 30 fraternities at Indiana University were divided by 
the ratings of judges into three categories—high prestige, middle prestige, 
and low prestige—and were compared with dormitories and off-campus 
housing. A relationship existed between fraternity prestige and academic 
achievement, although this was partly due to the higher ability of members. 
[Crookston (1961) noted differences on the prestige axis resulting from 
selection of pledges and their treatment.] The average abilities of the five 
groups grew increasingly similar toward the senior year as the poor- 
ability students of the low prestige fraternity group and the off-campus 
group were weeded out. Fraternities with an average or better reputation 
and the residence hall environment seemed to produce the best atmosphere 
for academic achievement, whereas off-campus living and fraternities with 
low prestige seemed to have a negative effect. [Buckner (1961), Crook- 
ston (1960), and Prusok and Walsh (1964) found no differences in 
freshman-year grades by place of residence.] The dropout rate was higher 
for the off-campus group and the residence hall group than for any o 
the fraternity groups. In a different setting, Willingham (1962) reporte! 


a similar finding. 


Religious Activities 


A number of articles have appeared during the past five paea 
ing religious activities programs in higher education. Horer ( ! ae 
Carey (1962) found that in tax-supported institutions contro are place: 
under the dean of students, while in private institutions there are a num- 
ber of other arrangements. In general, the tax-supported —— ges 
vided more regulation of religious groups than did others. espite the 
fact that tax-supported institutions noted legal deir puer preventing 
overt budgetary supports, in common with other types be institutions t s 
provided meeting places and other support that effectively “oe the 
Place of such programs on the campus. Over 97 percent of the inter- 
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denominational schools reported formal courses in religious education: 
slightly more than 50 percent of tax-supported schools reported such 
courses. 

Ferman (1961) reported negligible impact of the college experience in 
changing religious values. He concluded that most students made fairly 
permanent decisions regarding their religious values before entering 
college. 


Housing 


Rork and D'Amico (1962) reported that most U.S. universities, teachers 
colleges, and liberal arts colleges now provide some housing facilities. 
Most public junior colleges do not provide housing. Married students are 
increasing in numbers on campus [Mueller (1960) reported a rise from 
21 percent in 1955 to an anticipated 25 percent in 1965], and they require 
approximately two and one-half times more living space than do single 
students. 7 

Crane (1962) reported responses from 39 colleges and universities of 


2,000 to 8,000 students which yielded a n ivi fe 
teristics of residence halls. ial cata cia 


The living-learning system at Michigan State University, where class- 


vithin the coeducational dormitory 
l Ha, b). Eighty percent of the faculty 
ents al i i 

coeducational residential hall itself, FN A ou reni T 
students both in and ant of the class- 
of instructors and advisers, a more 
, and better esprit de corps. A related 
students with the living-learning type 
h social aspects and increased contact 


n the number of counselors was greater, 
more personal and psychological problems. 

ectiveness of using student advisers in 
with freshmen in residence halls. Using 
ge, perseverance in college, and change 
no significant differences were found 


ES ncy to a greater degree than do 
grades of students living alone. The overall influence of Mea tende 
to be more often negative than positive, 
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Marquardt (1961) found resident advisers to have the most profes- 
sional expectations of their role, followed by resident assistants, with 
the least professional expectations being held by the students. 

Koile and Hays (1962) analyzed critical incidents of head resident 
behavior reported by students as retarding or enhancing emotional 
maturity. Enhancing incidents met Rogers’ three basic conditions for 
psychological growth: unconditional positive regard, empathic understand- 
ing, and congruence. The use of these incidents was proposed for in- 
service training programs to assist in understanding student behavior 
and in fostering student development. 

A special training program in study habits and skills for graduate 
students serving as residence hall counselors made little change in each 
counselor's perception of his adequacy but improved public relations 
with students (Hood, 1962). 

Horle and Gazda (1963) gave to men's residence-hall counselors in a 
wide variety of schools a broad definition of counseling to use as a model 
in determining whether their activities could be so classified. They found 
that much so-called counseling was being done by untrained individuals 
without relevant educational background and experience. The residence 
hall position was primarily being used to finance further education, 
usually not in counseling and guidance. 

To study differential perceptions of the role of the student dormitory 
counselor, Gonyea and Warman (1962) devised the Counselor Perception 
Blank and Counseling Appropriateness Check List as instruments for 
selection of student counselors. The instruments were validated on ex- 
perienced counselors and then applied to groups including administrators, 
inexperienced counselors, and dormitory residents. The ideal student 
counselor was characterized primarily by needs for nurturance and intra- 
ception (need to understand people), to a lesser degree by dominance 
and achievement, and least of all by autonomy and exhibitionism. Admin- 
istrators and head residents held perceptions of counselor functions that 
differed markedly from those of the dormitory residents. i : 

Sheldon (1962) reported an evaluation in the sixth year of a University 
of Illinois program to prepare a small group of mature women for student 
personnel work in residence halls. It was concluded that the program 
proved to be a source of adequately trained and qualified staff. They 
developed good counseling, group advising, and program development 
skills. In some ways they were better accepted by students than by 
the young staff. 

Although a number of larger colleges and universities have replaced 
housemothers in dormitories with guidance personnel, Otto (1961) con- 
sidered housemothers to be a neglected resource. A three-year five- 
institution study, which included interviews and group discussions with 
students and conferences with administrators and housemothers, led to 
the conclusion that the housemother, because of daily and continuous 
contact with the student, has a key position influencing student emotional 
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health, attitude, and adjustment toward college and determines the atmos- 
phere in the campus residence which, in turn, can affect the morale of 
the whole institution. 


As a selection procedure, Sheeder (1963) used staff in various roles 
to create life situations in which applicants for counseling positions had 


Prusok (1960) surveyed fresh: 
at the State University of Iowa. Finances were the primary reason for 
selecting off-campus housing, with a desire for independence next. These 
students tended to participate in student activities less and to use uni- 


versity special services less, seeking help instead from other students, 
instructors, and faculty advisers, A signi 


existed between the off-campus studen 
Gross (1961) studied 273 students at 


Student personnel workers involved 
students wi 


: nts and can be used to help foreign students 
and to devise programs to meet their needs, 


Financial Aid, Employment, and Placement 


Thistlethwaite (1959) studied 
college-going behavior of talente 
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characteristics which modulate the impact of scholarships. A sample 
of National Merit Scholarship winners was matched with a sample of 
near winners who received no aid on the basis of sex, total Scholastic 
Aptitude Test score. percentile rank in high school graduating class, 
and available family resources. The immediate effects of the award were 
increased attendance at college of first choice, colleges with small fresh- 
man classes, privately controlled colleges, and colleges with religious 
affiliation. Large stipends tended to influence attendance at high-cost 
schools and all-male or all-female schools. to promote more advanced 
training, and to improve chances of acceptance to colleges with selective 
admissions. 

Wright and Clark (1961) investigated the effects upon extracurricular 
Participation of academic endeavor to meet requirements for renewing 
scholarships. Students who met scholarship-renewing requirements be- 
longed to a greater number of well defined student organizations. Those 
failing to win renewal of their scholarships participated in a greater 
number of activities of “outside” organizations. 

Harrington (1964) evaluated an emergency student loan program. 
Although there was no correlation between grade point average and 
repayment experience, active participation in extracurricular activities in 
high school and college accompanied prompt repayment patterns. Per- 
Sonal interviews at the time of the loan and mailed reminders increased 
repayment rates. That students have a greater capacity for meeting fi- 
nancial needs than is supposed was indicated by the number who obtained 
funds from other sources when refused by the institution. 

Adams (1962) developed a system of classifying by title, code, and 

escription the on-campus part-time jobs performed by college students, 
with the primary purpose of attempting to determine their educational 
value. The analysis was done on jobs of 2,088 students based on 166 
job descriptions, 16 occupational groups, and 5 major occupational divi- 
sions, with the educational progress of the student paralleling the com- 
plexity of the job. 

In 1962 nearly three fourths of all college degrees awarded indicated 
that half a million employment candidates were serviced by college place- 
ment offices. Small colleges were found to have better success in getting 
early placement for their graduates than large colleges, and as many or 
more of their graduates continued for graduate work (Brooks, 1964). 

Stevens (1963) classified student clients as (a) ready to be placed in a 
specific job in a specific geographic area, (b) not ready to be placed, or 
(c) showing no conclusive evidence of readiness or unreadiness. From 
interviews she derived several promising new instruments: a Placement 
Readiness Scale, an Over-all Impression Readiness Rating Scale, and a 
Placement Prediction Scale. A Placement Success Scale was used as a 
Criterion measurement. Using as subjects 100 New York University Place. 
ment Service registrants seeking jobs in education, she found the Place. 
ment Readiness Scale to be more effective in predicting outcome than was 
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a counselor's intuitive judgment, and success in job placement appeared 
to be associated with the level of placement readiness. 

Brown (1961) utilized case studies to assess the practices of 100 stu- 
dent personnel officers at as many schools with regard to revealing student 
record information to prospective employers. The wide discrepancies sug- 
gested the need for development of policy guidelines governing use of 
such records. 

About 78 percent of placement directors in a sample of 184 schools 
opposed special fees for this college service (Eckberg, 1963). Of those 
in favor of fees, 78 percent represented private schools. Of the 184 schools, 


39, or 21 percent, actually charged fees. Of these, 11 favored dropping 
them. 


Health Services 


Students who were fre 
those who made average or small use of the 


opment and in development of interpersonal 

relations, manifested hostile-dependent conflicts in relation to their parents, 

and described homes as places of discord and strife (Wharton, 1962). 
Kirk (1964) developed an i 


x i g orty-eight health service directors placed 247 
discrete items into 31 categories, with each category assigned 100 pos- 
sible points for a total of 3,100 points. Tried by 13 institutions, the 
instrument was found to þe valid, reliable, and suitable for evaluating 
any college or university health service, either by its own director or by 
a competent outside examiner. 

Whittington (1961) noted that only one half of all colleges have any 


program and only 14 percent have facilities 


and universities, 18 of which also 


e zel tutions charged fees; 34 provided free 
services. The remaining 12 charged according to the nature and extent 
of each case. 
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Discipline 


The need for long-range follow-up evaluations of counseling-oriented 
discipline still exists. 

Prusok (1961, 1963) investigated the attitudes of students, parents of 
students, and student personnel workers toward 76 university disciplinary 
Situations. He found student personnel workers most punitive in their 
attitudes, parents next, and students least punitive. Inasmuch as the focus 
of the study was only on punitive action, there was a builtin bias in 
attitudes. An evaluation of student mores by Hodinko (1964) revealed that 
high-ranking offenses in the eyes of undergraduate students were theft 
for material value, cheating involving collusion, and sexual promiscuity. 
Low-ranking offenses were theft for novelty value, possession of beer on 
campus, and drinking by a minor. He observed that men were more 
critical of acts involving cheating, property destruction, or theft than were 
women and were more approving of misuse of alcoholic beverages and 
sexual promiscuity. Older students took a more serious view of prankish 
behavior, and fraternity men generally expressed milder censure in all 
Categories than did independent students. Sorority women, however, re- 
acted with more strictness in all categories than did independent women. 

Sillers and Feder (1964) used a measuring device which permitted 
inference of attitudes from personal evaluations of factual situations 
and followed up with interviews to appraise the attitudes of college presi- 
dents, academic deans, and student personnel deans in nine public insti- 
tutions of higher education toward disciplinary problems. Extensive data 
Concerning each student case were supplied. In general, the data were 
used more on a post hoc basis for justification of attitude and action 
than for understanding the individual. A unique point of view was not 
found among the student personnel workers. There was more similarity 
of attitude among individuals within a given institution than among 
individuals of a given professional type in the several institutions. 


Research and Evaluation Functions 


Commenting on the lack of available knowledge, Williamson (1964) 
noted that much contemporary student personnel work is not grounded in 
basic research. The built-in improvement of practice and technique through 
Continuing research on basic processes must be accompanied by more re- 
Search on the basic phenomena of the college adolescent and his full 
development with intensified research on the institutional settings in which 
such development takes place. 

Robinson and Brown (1961) found 77 studies focused on students 
and student personnel programs being conducted by 13 agencies and 
centers for the study of higher education. Of highest frequency were 
studies of student characteristics, attendance, retention and withdrawal 
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of college students, and student personnel programs. Berrian (1962) 
examined the Education Index for the years 1910 through 1959 for 
research articles on student personnel work in higher education. She 
noted a gradual increase in the number of articles, a slight increase in 
each decade in the proportion of pages on student personnel work. and 
the greatest concern of writers for educational adjustment, college per- 
sonnel programs, tests and measurements, and adjustment factors. More 
articles dealt with selection for admission and counseling than with 
other services. Descriptive research was most frequent, and psychological 
tests and questionnaires were the most used research tools. In the last 
decade much more attention was paid to sampling methods and statistical 
procedures. 

Fisher and Roth (1961) pointed out major restrictions and limitations 
of current research in the field: single factors are most often investigated ; 
single instruments are used: and single dimensions are measured. They 
noted an absence of postulates and hypothetical systems in m 


any research 
programs and stressed the 


interdependent functions and criteria of 
adequacy as the other two aspects of adequate research structure. They 
suggested a pattern for the complex research methodology required by 
the complex operations of educational functions. 

Crookston and Blaesser (1962) 
in various aspects of college life w 
They proposed force-field analysis 


analyzed some of the major changes 
hich seem to be continuing to change. 


for the study of the structure and proc- 
ess of change, presented diagrams of situations illustrating the driving and 


restraining forces operating against the quasi-stationary equilibrium, and 


proposed a research and an operating structure for the systematic and 
experimental study of change. 


Williamson (1961) offered a number 
the possibility of new con 
student personnel staff as w 
in the area of student exp 


In the significant theoretical articles of the period being reviewed. 
there is a strong thrust toward more basic research and a structiming of 
research designs which will take account of the complex variables operat- 
ing in the college student's life, With these improved approaches there 


is concomitant development of studies of Systems, their operational ef- 
fectiveness, and their differential end products. 


of testable hypotheses concerning 
structions in student relationships with the 
ell as with other agencies within an institution 
ression on social issues. 
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CHAPTER VII 


Professional Education 


STEPHEN ABRAHAMSON 


This separate section treating education for the professions has been 
included in the Review largely because of the sharp increase in research 
activity. Unfortunately, the very term profession can include different 
occupations depending on the definition given it. Blauch (1960) included 
“those forms of specialized education for which schools or colleges are 
maintained in the university.” Limiting the field further for the present 
chapter to include only education for medicine, dentistry, nursing, law. 
social work, and engineering, the author still reviewed more than 400 
Separate research reports. l 

l Since research for the most part is in single fields of professional educa- 
tion, a simple approach to organization would be grouping according to 
profession. However, treating common topics across profession lines af- 
fords a better opportunity to synthesize and present the common elements 
and problems of education for the professions. Thus, this chapter includes 
sections on students, curriculum development, techniques and materials of 
instruction, and faculty. 


Students 


Studies in the six professional education areas included in this chapter 
tend to fall into three major categories: prediction of success in the pro- 
fessional school, characteristics of students in the school, and follow-up 
of graduates and dropouts. 


Prediction 


In almost all studies professional school course grades were used as 
criteria for success. The prediction variables included aptitude tests, gen- 
eral intelligence measures, preprofessional academic achievement, and 
psychological traits. Some studies attempted to find the best single or 
multiple predictor. Schumacher (1963) attempted to present “a relatively 
simple, unweighted formula for predicting success in medical school.” He 
found the usual variables related to professional school achievement: age, 
undergraduate grades, and personal qualities noted in the interview. Aca- 
demic ability tended to be less important in clinical studies (i.e., student 
activities associated with direct experience in the profession). Gough 
(1963), using Pearson correlations, compared premedical college grades 
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and scores on the Medical College Admissions Test (MCAT) with medical 
School grade point averages and found many statistically significant 
coefficients. However, the highest were less than 0.5 and tended to be de- 
scriptive of the relationship between college science grades and first-year 
medical school grades and between scores on the MCAT science subtest 
and first-year medical school grades. Correlations decreased in magni- 
tude and significance when the criterion variables were used later as 
measures of achievement. Hood (1963) included 22 variables and found 
predental school academic grades the best single predictor. When he com- 
bined predictors, the best multiple coefficient was 0.70. Concern with poten- 
tial errors in screening was voiced by Willingham (1964), who found that 
although freshman chemistry and mathematics grades were the best indi- 
cators of subsequent success in technical curricula, the relationship was 
"moderate at best." Similar concern could be voiced by others, since 
correlations tend to be low although significant. Derevere (1961) found 
correlations between 0.26 and 0.39 when "chalk carving" and “object 
visualization” were compared with first-year and fourth-year grades in 
operative dentistry. Johnson (1962) also found low 
was 0.28) between selected a. 
grades in medical school a 
promise than correlations as 
off approach is better than an 
(1962) also expressed doubt, 
good medical students,” but 


school. This seems to contradict 
er a student’s desire to become 


ut the poorer his adjustment. Eichhorn 
and Kallas (1962) found a significant relationship, using chi square. 


when they studied dropping out of engineering school and such factors as 
mother’s education, siblings’ education, financing from parents. and 
fraternity membership. Finally, Martin (1962) found success and failure 


in nursing school significantly related to scores on achievement and in- 
telligence tests. 


another generalization of his: the strong 
a lawyer, the higher his grades b 7 


Student Characteristics 


During the five years under review 
studénts were published. Becker and oth 
and insightful sociological study of U 
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dents, describing major changes in the students during a four-year pro- 
gram. Horowitz (1964) provided documentation of the educational gains 
from curricülum revision at the Western Reserve University School of 
Medicine with his 20 interesting, informative, and competently done case 
studies of students in that program. 

Literature in professional education includes many studies of student 
attitudes. Despite the usual problems of attitude measurement, many in- 
vestigators managed to use existing scales or to construct their own. 
Rosinski (1963) surveyed medical student attitudes with a specially de- 
signed card-sort attitude scale and discovered positive changes in attitude 
toward medicine over the four years of medical school but significantly 
negative changes toward education and research. Stiff and Phipps (1964) 
tested attitudes of dental students toward chronically ill, aged, and dis- 
abled persons, using two scales: favorable change in attitudes before and 
after clinical dental experience. Varley (1963) described social work 
values of beginning and graduating students on four scales of high re- 
liability: equal rights, service, psychodynamic-mindedness, and universal. 
ism. She found no differences when both t-test and analysis of variance 
were used. 

Levitt, Lubin, and Zuckerman (1962) administered the Edwards Per- 
sonal Preference Schedule to 1,000 women including 212 student nurses 
and 167 graduate nurses. They found an increase in social feminine need 
as the college woman became a student nurse. As the student nurse be- 
came a graduate nurse, there was an accompanying decrease in nurturance 
and an increase in deference, order, and endurance. 


Enrollment in Professional Schools 


Enrollment trends in engineering education were described by Kriegel 
(1963), who concluded that the number of advanced-degree candidates 
has been rapidly rising, and that while the demand for advanced-degree 
engineers is expected to continue, it seems quite dependent upon the 
federal government's attitude. Social work enrollment trends were explored 
by Pins (1963) in a very thorough survey in which he obtained 2,900 
responses to 2,964 questionnaires administered to full-time first-year stu- 
dents in the fall of 1960, and by Ross and Lichtenberg (1963), who re- 
viewed the period from 1950 to 1960. Annual reviews of medical school 
enrollment figures appeared in the Journal of Medical Education (Johnson, 
1964; Hutchins, 1963a, b; and Hutchins and Gee, 1961, 1962). 


Follow-Up Studies 


Stewart and Pennell (1961) conducted a survey of all U.S. medical 
school graduates living in 1959, examining such things as the nuinber 
of graduates per school, the number in private practice, the number 
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of full-time specialists, and the physicians’ locations. Schumacher (1964) 
attempted to analyze choice of medical specialty according to marital 
status, medical school attended, size of city in which the graduates grew 
up, scholastic aptitude, certain values, and certain personality traits. Bristol 
(1962) asked graduating seniors in engineering to complete a question- 
naire giving reasons for job acceptance. The five reasons given most 
frequently were type of work involved, location, salary, opportunities for 
advancement, and opportunities for advanced-degree study. 

Dropouts are a serious and costly problem in professional education. 
Larsen (1962) studied students in the general nursing program at the 
University of Buffalo and found that discouraging factors include costs, 
credits required for the degree, the five-year time limit, and the field 
experience requirement. Kibrick (1963) found the following factors 
related to dropping out: incomplete knowled 
of authority and routine, high independe 
gerial home background. 


ge of the program, resentment 
nce, ambivalence, and mana- 


Curriculum Development 


ounds with discussions of scope 
and experimental programs. Research, 
reports of curriculum revision. Medical 
d since World War II by large-scale 


stitutions. These are reviewed precisely 
me M dr Lee (1962), who describes nine "experiments." bar 
most notable of these is that of West iversi pte 
ie by Hon (1962). estern Reserve University, repo 


Experimental Courses 


The kinds of courses reported on do not lend themselves to true 
n à special 


by virtue of research desi 


ion with the use of fairly sophisticated 
ker (1962) reported on an experi- 


kc nuu education for engineering students and 
reported a "feeling" that the course provides broad background in an 


important area. Elmer (1962), Teporting on experimental field placement 
of social work students, provided no data but concluded that “students . - - 
evidenced heightened knowledge of . . , principles.” 

With a reasonable research design involving five 10-student groups 
(two separate discussion Broups, two separate discussion groups with 
clinical experience, and one group using the “current method”), Schield 
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(1962) explored the “merits of an experimental course in dental technics” 
and reported a significant (p=0.06) advantage to the discussion-and-clinic 
experience. Shatin and Nenno (1963) used a questionnaire to evaluate 
Seton Hall’s new 68-hour teaching program for first-year medical students. 
while Aldrich and Bernhardt (1963) used a “problems cases test,” con- 
sisting of seven vignettes and open-ended questions, in assessing the worth 
of a new program in teaching psychiatry to medical students. Results 
showed no significant advantage of the newer course. Hand (1964) 
studied progress in technical writing for engineering students and con- 
cluded that the most important areas of growth were brought about by 
Practice. Finally, Hayes and Swenson (1963), in a reasonably well. 
designed research study of the effects of a three-month educational ex- 
perience in psychiatric nursing, found no change in adjustment scores or 
in self-concept. 


General Surveys 


Nelson (1960) investigated trends in nursing curriculum change 
through a questionnaire to 30 schools of nursing and found the major 
trend “a process of integration which places the material in the static 
framework" of specialty nursing care. Nursing Outlook (1961) reported 
a survey of 381 diploma programs in the North Atlantic region and com- 
Pared accredited schools with nonaccredited schools. 

Rosinski and Blanton (1962) attempted to list, categorize, and store 
9n data-processing cards the entire content of a medical school curriculum. 
They had each department of the Medical College of Virginia report its 
content, the educational objectives involved in each "bit" (i.e., informa- 
tion, understanding, attitudes, skills), and the instructional techniques 
employed. Through this approach they were able to identify blind duplica- 
tions of subject matter coverage. 


Course Content 


Literature on professional education has been concerned with what is 
taught and what should be taught in specific familiar courses. Tager 
(1964) surveyed the specific lecture content of 32 microbiology depart- 
ments in U.S. medical schools. Neidle and Hampel (1963), after return 
of a rather lengthy questionnaire from 40 of 53 schools contacted, cata- 
loged the instruments used in laboratory instruction, the visual aids 
employed, and the problems encountered in teaching physiology in dental 
schools. Budowsky and others (1961) used a questionnaire to study 
methods of teaching oral roentgenology in dental schools. They received 
responses from 36 schools but made no mention of what the percentage 
response was. South (1961) reported on a survey of tuberculosis educa- 
tion in 1,099 schools of nursing but did not give percentage responses 
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either. Such surveys can be quite useful to educators involved in curricu- 
lum planning and should not therefore be disparaged, despite the resultant 
lists or tabulations of practice that often seem to provide no insights or 
hypotheses for professional education. 


Techniques and Materials of Instruction 


Those who are trying for the first time to seek answers to problems 
in education often follow a developmental pattern of research. It is not 
unusual to find an early preoccupation with the effectiveness of familiar 
instruction techniques, particularly the lecture, small-group discussion, 
demonstration, and laboratory. In general, when these studies are designed 
well and the tests are rigorous, results tend to show no difference or a slight 
edge to the less formal of the techniques under study. 

Netsky, Banghart, and Hain (1964) found no significant differences 

soe D 
when they divided 74 students into two groups (alphabetically ) and 
treated one with seven lectures and a two-hour laboratory session after 
each and the other with seven one-hour seminars and a two-hour labora- 


ups as compared to lectures. 
ation of the use of low-cost motion 


» therefore, to the less-experienced teacher. 


of an experimental method of teaching hematology in 13 schools. Ginther 


« 
x M results tend to establish method as an important main effect" 
and the amount of time scheduled . :- Was an important main effect.” 


^ study by Higdon, Erdle, and Quanbeck (1962) tested the effect 


in achievement and suggeste 
beneficial. 
Testing and Grading 


The studies reported 
education for the profe 
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of knowledge, development of attitudes, and synthesis of these into an 
integrated performance, scholars in this field tend to be concerned with 
the design of new techniques and the testing of old. 

Boyle (1964) attempted to validate short-answer achievement tests 
in engineering. He explained a method for computing index values for 
determination of advantages and disadvantages of short-answer tests. 
Abrahamson (1964) described a routine test of interrater reliability in 
which five pairs of instructors were required to read and grade essay 
examination questions. The readers disagreed as often as they agreed 
about whether the student’s response contained the essential material, 
whether the student displayed an adequate understanding, and just how 
good the total response was. 

Medical education, perhaps more than any other professional education, 
seems to have been concerned with measurement of clinical performance. 
Liske, Ort, and Ford (1964) reported the development of a clinical per- 
formance rating scale in which there was only limited interrater agree- 
ment. Cline (1961) described the development of a film-and-test technique. 
He presented tests of the relationships between performance on a test 
accompanying a sound film, grades in internal medicine, and MCAT 
scores. The highest single correlation found was 0.32 between scores in 
the first film test and MCAT science subtest scores. The development of 
tests of diagnostic skills was summarized by Rimoldi (1961). 

Balabanian (1964) reported on an experiment to reduce tensions re- 
sulting from grading by introducing a contract system between student 
and teacher. He concluded that such contract assignments (of varying 
amounts for A, B, and C grades) may be a good substitute for traditional 
grading practices. Palmer (1962), also looking at grading, had nursing 
students perform an anecdotal self-evaluation, applying the same criteria 
as the instructors. After a two-year test with two classes, Palmer con- 
cluded that self-evaluation was reliable, bringing personal satisfaction 
and self-insight to students. An incidental benefit was the development of 
more trust between teachers and students. 


Programed Instruction 


Research tended to show a slightly greater informational gain with the 
use of programed materials. None of the research discussed cost, and 
almost none concerned retention, both of which need investigation. 

Using a good experimental design, Green, Weiss, and Nice (1962) 
tested the use of a programed text in a medical school course. The results 
favored programed instruction; interestingly enough, greater efficiency 
was found for all students, experimental and control alike. Cheris and 
Cheris (1964) indicated superiority of programed instruction for 46 
third-year students whom they divided into two groups, matched according 
to grade point average, for teaching radiology in a medical school. They 
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also reported, however, that “more efficient learning results from a text 
a. 

a 9 (1963) tested programed instruction in dental e 
but did not report the use of any statistical test of significance. He oai 
cluded that “self-instructional exercises are slightly more effective a 
‘traditional presentation." Kelso (1961) showed the pers. E 
programing in law school. He divided 53 third-year law students B ? 
experimental and control groups, giving the former 82 pages of m 
mediate response” program instead of classes. He mentioned no test o 
significance but did report the two means. acis 

The popular interest in programed instruction is demonstrated "es d 
way some results are reported. Craytor and Lysaught (1961) divi 
81 nursing students into an experimental group using a Skinner-type 
program of 221 small steps and a control group with “normal class prem 
tation.” They reported that students “learned as well by programe! 
methods as by normal instruction.” O’Brien and Ryge (1963) used pro- 
gramed learning with dental hygiene students, and while they reported no 


difference between treatments, they did state that “programed instruction 
is as effective as the lecture.” 


Use of Television 


Research into the use of te 
reveals “no difference” after 
the extent of television use. R 


levision in professional education generally 
experiment. There were some surveys O 

omano (1964) asked all 48 dental schools 
whether they were using television in some form and followed up his 
Survey by visiting 20 schools that Were using television; he described 
each in detail. 


Grossman, Ship, and Romano (1961) re 
which the 


ported on a three-year study 
third-year class w 


as divided randomly each year into four 
*5: in one group, seven or eight students 
; in another, 30 students were in the studio, 
ing a television monitor; in a third, 30 stu- 
m with two screens where there was limited 
treatment included no demonstration. No 
and others (1962) , 


in 


communication; the fourth 
differences were noted. Grant 


using a good research 
design, divided a sophomore 


class in dentistry into four groups: in ones 
students watched television monitors and were tested immediately: 1” 


another, students watched monitors and were tested two days later: 19 
the third group, lectures and slides were followed by immediate testing: 
and in the fourth, lectures and slides were followed by testing two 


days later. There were no differences in achievement between the groups 
using television and those using lectures and slides. 


Staub and Merrill (1963) found no differences in scientific reasoning 
between a group who watched a television demonstration of laboratory 
work and another group who performed in the laboratory. Westley an 
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Hornback (1964) found no significant differences in a study involving 
56 student nurses. The experimental group received videotaped lessons 
in three basic nursing units of instruction, and the control group used 
“normal classroom procedures.” A difference was noted in a study by 
Lancaster and Erskine (1962) which involved 1,332 freshman engineering 
students divided randomly into three categories: one for small-group 
instruction in groups ranging from 15 to 22, a second for large-group 
classes ranging from 58 to 67, and a third for televised instruction for 
groups of 102 to 108. The large groups did significantly better in analytic 
geometry than the other two treatment groups, while the television groups 
did better—but not significantly better—than the small groups. 


Faculty 


In medical education, more than in other areas of professional edu- 
cation, there has been an increase in research on improvement of teaching. 
One major impetus was the University of Buffalo Project in Medical 
Education, which began with a test of the hypothesis that medical school 
teachers might profit from seminars in education (Abrahamson, 1960). 
The Project then tested the hypothesis that intensive summer workshops 
might improve instruction, primarily through a change in attitudes 
(Rosinski and Miller, 1962b). Faculty teaching practices and attitudes 
were studied in seven different medical schools. Jason (1962) designed 
a series of rating scales with a high interrater reliability and observed 
teaching practices in seven medical schools, using teams of six observers 
in each—three medical educators and three educationists. An accompany- 
ing study reported by Rosinski and Miller (1962a) tested faculty atti- 
tudes in the same seven medical schools with a series of attitude scales 
developed by Rosinski. 


Summary 


Finding the research in professional education was an arduous task 
because of the diversity of sources and the quantity of material to be 
covered. Perhaps more difficult was the task of choosing studies to be 
included. The research is increasing astonishingly in quantity and quite 
steadily in quality as educational research expertise is drawn into the 
area of investigation. 

Research tends to be descriptive, except for that on instructional tech- 
niques and materials in which experimental investigation is under way. 
What is needed is further development of cooperative effort or at least 
common adherence to guidelines which will help yield comparable data. 
Obviously, a theoretical framework is not available; its beginning would 
be a major contribution to research effort. 
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CHAPTER VIII 


Organization and Administration, 
Finance and Facilities 


JAMES I. DOI 


Prior to 1960, research on college and university organization meant the 
identification and analysis of the formal structural aspects of an institu- 
tion. A convenient way to depict organization was to draw a chart showing 
the various administrative officers, academic divisions, and departments. 
The same chart could show line and staff relationships, spans of control, 
official channels of communication, and formal patterns of subordinance, 
superordinance, and coordinance. Research on administration focused on 
the duties and responsibilities of officers, on administrative policies and 
practices, and on a wide array of problems under such headings as the 
governing board, student government, and faculty-administrative relation- 
ships. 

Such research often succeeded in identifying common practices and led 
to the formulation of principles of organization and administration and 
recommendations on policies and practices. These principles, recommen- 
dations, and information on common practices did much to shape institu- 
tions of higher education as we know them today. They served and con- 
tinue to serve as working tools for the administrator. 

The study by Ayers and Russel (1962) dealing with the internal struc- 
ture of colleges and universities is typical of this traditional research. The 
basic data were drawn from the current organization charts of 608 institu- 
tions. The major limitations of this study, which the authors recognized, 
are the same as those of similar studies that preceded it: it assumes that 
position title and list of duties accurately describe what an officer actually 
does; it assumes that the patterns of communication and relationships 
depicted by an organization chart are the actual patterns by which the 
members of the institution relate; and it contributes to an oversimplified, 
misleading view of the nature of authority and decision making in an 
academic organization. 

In recent years, especially since 1960, the term organization has come 
lo take on different connotations in research literature on higher educa- 
tion, and the focuses of research on administration also have changed. 
The differences in both instances support the observation that a new era 
of research on the organization and administration of higher education 
is in the making. The salient features of this new era, at least in this 


early stage, appear to be the following: 
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l. Interest in the applicability of theories of formal organization to 
higher education institutions and in the development of a theory of 
college organization, ; 

2. Interest in both the formal and informal aspects of college organiza- 
tion, 

3. Interest in the application of communication and decision-making 
theories and models to the governing of colleges and universities, 

4. Accelerated diffusion of the terminology and frames of reference 
of psychology, sociology, political science, public administration, and 
business administration to include higher education. 


To identify publications in the several disciplines that are relevant to 
research on college organization and administration is beyond the scope 
of this chapter. However, a few such works will be cited here simply be- 
cause it is virtually impossible to discuss research on college organiza- 
tion and administration without reference to them. For a comprehensive 
review, the reader is referred to the October 1964 issue of the Review, in 
which Lipham (Chapter IV) and Briner and Campbell (Chapter VII) 
commented on works from the behavio 
they regarded as relevant to the study o 
cational organizations and to the develo 
tion. 

This chapter will confine its attention 


directly with college organization and 
cited are more 


tive studies rath 
theory. 


ral and political sciences which 
f administrative behavior in edu- 
pment of a science of administra- 


primarily to selected works dealing 
administration. Many of the works 
provocative than empirical, and the empirical are descrip- 
er than tests of hypotheses derived from or directed toward 


Organization 


tion of the informal aspects of an 
(1962), the formal and 


C UBBes . tegory scheme for classifying oT 
ganizations by function in society: or 


mim ganizations oriented to economic 
production, organizations oriented to 


; i political goals, integrative organiza- 
tions, and E c organizations. Pattern-maintenance OT- 
ganizations are characterized by either cultural, educational, or expres- 


sive functions. Blau and Scott (1962) , using prime beneficiary as their 
criterion, classified formal organizations into mutual-benefit associations. 
business concerns, service organizations, and commonweal organizations. 
In this scheme the university is a service organization. Etzioni (1961); 
using compliance as a comparative base, grouped organizations into those 
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that are predominantly coercive, utilitarian, normative, and those that 
are dual-structured. Colleges and universities are predominantly norma- 
tive, in that compliance rests on internalization of directives accepted as 
legitimate. Leadership, rituals, and manipulation of social and prestige 
symbols are among the more important techniques of control in norma- 
tive organizations. 

The term bureaucracy is invariably used in discussions of formal or- 
ganizations, and faculty members and editors of student newspapers use 
the term freely as a synonym for university. Blau and Scott (1962) noted 
that a formal organization possesses some sort of administrative ma- 
chinery. In sociology, bureaucracy is a term used neutrally to refer to 
the administrative aspects of an organization. Their definition of bureauc- 
racy is particularly interesting, since it is pertinent to much being written 
about contemporary college organizations. They defined bureaucratization 
as effort devoted to maintaining the organization rather than to directly 
achieving its objectives. 

Anderson (1963) gave a succinct summary of the major characteristics 
of bureaucracy, collegiality, and community and speculated on their 
validity as bases for understanding colleges and universities. Regarding 
business affairs, student personnel services, and other nonacademic ad- 
ministrative organizations within a university as bureaucratic, he con- 
centrated his observations on the collective faculty. He noted that collegi- 
ality is not democracy in administration and suggested a test of the 
hypothesis that a collegium is inevitably an oligarchy. 

In discussing community, Anderson strongly doubted that colleges and 
universities can be demonstrated to be communities. He differentiated 
between what they ought to be and what they are and suggested that the 
concept of community is important in understanding faculty and adminis- 
trative behavior. Many faculty members are guided by the belief that 
colleges and universities ought to be communities, and administrators may 
find it advantageous to operate as if such organizations were communities. 

Clark (1961) identified three conflicting concepts of authority in col- 
leges and universities: public trust as represented by a lay governing 
board, bureaucratic authority as represented by the administration, and 
colleague authority as represented by the faculty. The latter two he saw 
as increasing in strength. In a more recent treatise, Clark (1963) pointed 
to four trends in the social organization of the campus: unitary to com- 
posite or federal structure, single to multiple value systems, nonprofessional 
to professional work, and consensus to bureaucratic coordination. He sug- 
gested that the notion of community should be abandoned, since genuine 
community is no longer with us. 

Millett (1962) viewed colleges and universities as organized on the 
principle not of hierarchy of authority but of community of authority. 
Power is shared by faculty, students, alumni, and administration; and 
Coordination is achieved through consensus. Millett saw colleges and uni- 
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versities as so different from governmental and business entities that 
ideas on organization drawn from studies of the latter give no help in 
understanding the former. Corson (1960), however, maintained that 
colleges and universities are like other organizations in some important 
respects though different in others. . 

Goodman (1962) presented a proposal for a college truly committed 
to teaching and learning and, as a community of scholars, entirely free 
of external control, administration, and bureaucratic machinery. A con- 
trasting concept is Kerr's (1963) “multiversity,” not one community but 
many. Presthus (1962) declared that structure and control have overcome 
spirit and purpose in our universities, and Gross (1963) detected serious 
disparities between goals and structural arrangements, 

Other provocative works with implications for theory and research can 
be cited, but these should be sufficient to indicate that there does not yet 
exist a coherent theory of collegiate organization 
might derive hypotheses for investigations. McConnell (1963) noted this 
lack, and Kimball (1962-63) called for greater researc 
higher education, particularly on organizational theory. 


By way of actual research, Haas and Collen (1963) focused on varia- 


Measurement and Description 


Descriptions of colleges and universities 
journalistic in tone, to attempts at more 
and from those focused on individual sch 


ignificant is that it appears 
Y comprehensive sociological analysis of how 
various factors affected the evolution of a college organization. 


More typical of studies describing college organizations are Mayhew’s 
(1962) work on the liberal arts college, Carlson’s (1962) treatise on the 
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municipal university, and Wicke's (1961) book on the church-related 
college. Each identified the salient features of a particular type or group 
of institutions and discussed problems common to them. Quantitative data 
included are usually of the kind gathered by federal and accrediting 
agencies. In describing church-related colleges, Wicke made effective use 
of survey data, such as number and credit hours of courses offered, student- 
faculty ratios, average salaries, proportion of faculty with doctor's degrees, 
and income and expenditure patterns. 

A relatively new development in efforts to describe and measure colleges 
and universities is the sociopsychological study of campus environment. 
Literature pertinent to such research is reviewed in Chapter II of this 
issue. 

Barton's (1961) suggested classification of organizational variables and 
his list of empirical measures of college characteristics may well form 
the basis for more comprehensive and meaningful descriptive analysis of 
college organizations. The list includes not only demographic, fiscal, and 
Prognan data but also data on social structure, attitudes, and role be- 
havior. 


Administration 
Governance, Decision Making, and Management 


The term administration is still used in higher education, but its im- 
portance is somewhat obscured these days by at least three relatives— 
governance, decision making, and management. Hungate (1964), for ex- 
ample, declared that the term management has broader meaning than 
administration since the latter is sometimes interpreted to exclude policy 
formation and coordinated planning. For Corson (1960), governance is 
the process or art by which various people associated with a college insti- 
tution establish and carry out rules and regulations. 

Focus on decision making takes the study of college and university 
administration into a broad field of behavioral science research. Wasser- 
man and Silander (1964) annotated a bibliography on decision making 
which includes approximately 750 items published between 1958 and 
1963. Henderson (1963), in urging research on decision making in colleges 
and universities, suggested studies involving the use of various theories 
and concepts of management, leadership, and communication advocated by 
behavioral scientists and students of public and business administration. 
Corson (1960), in the course of illuminating various aspects of academic 
governance, posed and left unanswered enough questions to occupy a 
generation of researchers. 

Contemporary university administration is definitely different from past 
administration. Rourke and Brooks (1964) called it the “managerial revo- 
lution”—the trend toward scientific management illustrated by organized 
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institutional research, increasing use of quantitative aee 
analysis, and use of tes Steg ba ipea P s bend 
i i icles devoted to ins sea 

Poe e as E The title of the collection gi 
that the methods and reports constitute a basis for decisions. Wit a 
formalization of the annual National Institutional Research ae em 
was published, under the editorship of Lins (1963), a collection: o ue 
defining the relationship of planning to institutional research: S ion 
under the editorship of Bagley (1964), a set of papers was publis 


s um - institu- 
representing attempts to establish a conceptual framework for instit 
tional research. 


Administrators—Role, Selection, and Training 


College and university administrators still remain untouchable as 1 
jects for systematic research on role perception and conflict, persona it) 
characteristics, value orientation, status-seeking behavior, and identifica- 
tion with institution. Published works about administr 1 
there is no dearth, are mainly anecdotal or advisory. The more rip 
statements attempt to cover such matters as style; leadership: duties an ‘ 
responsibilities ; relationships with administrative colleagues, faculty, en 
dents, governing board, and the public; and selection and training. Do d 
(1962) and Prator (1963) focused on the presidency; Corson (1960) an ; 
à group of contributors to a book edited by Burns (1962), on a wide mv 
of administrators ranging from the president to departmental chairmen; 


and Mobberley and Wicke (1962), on the deanship of the liberal arts 
college. 


Bolman (1964) review. 


ators, of which 


NP, e s t 
ed programs for training administrators, aM 
of which have developed in recent years. He detected two major flaw 


in current efforts: lack of coordination and lack of evaluation. 
The resistance of administrator: 


sometimes been overcome. Gould (1964), 


in his study of the academic 
deanship, 


d questionnaires from almost 
ns in the 50 states. In addition 


of college president 
of nonparochial four-year col- 


ppointed to their positions during 
1959-62. Confidential interviews were held with more than 100 persons, 


among them members of governing boards, faculties, alumni, and a 
ministrative staffs. The report contains procedural suggestions on p 
aspects of the selection process as launching the search, the faculty's ro i 
establishing the qualifications, and finding the man. The contents are in 
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teresting, but the reader is likely to be left with the distinct impression that 
much of the data obtained were never transferred from folders marked 
"confidential" to the printed page. 


Systems—Interinstitutional Cooperation and Coordination 


Chambers’ (1961) study of voluntary statewide coordination in public 
higher education is both a description of five successful instances of such 
efforts (in Ohio, Michigan, Indiana, Colorado. and California) and a 
statement of the point of view that voluntary coordination is more con- 
sistent with the nature of the collegiate institution than is statutory 
(formal) coordination. Since the publication of Chambers’ report, how- 
ever, two of the five states have abandoned voluntary and adopted statutory 
coordination, and in one other state survival of voluntary coordination is 
in doubt. 

McConnell (1962), in his commentary on a general pattern for public 
higher education, took the point of view that coordination is inevitable. 
Anderson (1964) identified more than 80 cooperative programs relating 
to undergraduate education. The description of each program is very 
sketchy, but the list is nonetheless informative. Conant (1964) not only 
approved of statewide and regional coordination but also proposed a 50- 
state compact for the creation of an interstate commission to plan a 
nationwide educational policy. Wilson (1964) raised a number of ques- 
tions concerning form and function of American higher education and 
declared that resistance to organization and change is no longer tenable. 
Further observations regarding emerging systems in higher education— 
state systems and plans, consortia and interinstitutional arrangements, 
interstate cooperation, and national organizations for higher education— 
are presented by the American Council on Education (1961). 

The effects of the emergence and evolution of these systems on the 
autonomy and interdependence of colleges and universities have yet to be 
adequately observed, investigated, or described. There are opportunities 
here for significant research on the ecology of governance and the dy- 
namics of organizational change. 

Pertinent to the foregoing are Eckerman's (1964) study of the public 
image of state and private universities and Banfield's (1961) study of the 
politics of the decision to establish a Chicago branch of the University 
of Illinois. The Eckerman study, a national survey employing a stratified 
probability sample of 1,310 respondents, indicated that the distinction be- 
tween state and private universities is a functional one for only a minority 
of the population. 


Finance and Facilities 


Research on the finance of higher education currently is focused mainly 
on costs and productivity measures, budgetary techniques and long-range 
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fiscal planning, development programs and voluntary gifts, and a broad 
group of questions encompassed by the term "economics of higher educa- 
tion.” In the interest of brevity, the ensuing paragraphs will deal with 
only a small selection of recent studies. The governmental aspects of 
finance are covered in Chapter IX of this issue of the Review, and J. A. 
Thomas, in the October 1964 Review, discussed literature pertinent to the 
economics of higher education. Also in the interest of brevity, the annual, 
biennial, and serial reports of the U.S. Office of Education, foundations, 
and associations will generally be excluded. 

Current literature on college and university facilities indicates con- 
siderable interest in campus planning and the year-round use of plants. 
Space utilization studies and projections of building needs, which several 
years ago were considered research, are now so common both in tech- 
nique and frequency that they are hardly worth mentioning. 


Costs and Productivity Measures 


Cost is a perennial subject of controversy 
ministrators of public colleges and universities 
if at all, and would like to av. 


in higher education. Ad- 
use cost figures cautiously. 
oid them altogether if the data may possibly 
be used for determining appropriations. There is considerable justification 


for both caution and reluctance, since comparability between and among 
institutions is difficult to attain, 


Yet such writers as H 
computed cost figures 
commenting on the fi 
Group in the Econom 


ics of Education (1964). a group of economists from 
the United States an cin 


d a number of European nations, issued a report in 
ared with the costs of 
economists is essentially national but 

purposes, approximately accurate and 
comparable figures are given university, however, sub- 


r $ in competition for funds, much greater 
accuracy and comparability are essential, We may expect the ground rules 
- o 


for the computation of costs to be used for budgetary purposes to be 
quite different from those used by economists concerned with the overall 
picture. Miller (1964) provided an excellent review of cost analysis 


procedures for state colleges and universities in California, Florida. 
Indiana, Kentucky, Texas, and several other states. : 

Other data of great interest to economists w 
lion are measures of productivity. Since product 
tion and quantification of both in 


riting on higher educa- 
ivity requires identifica- 
put and output, the application of this 
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education should also be explored. especially measures generally used by 
economists. 


Long-Range Fiscal Planning 


Tickton’s (1961) manual for developing a 10-year college budget is 
a small document that has had tremendous impact on the fiscal behavior 
of college administrations. It is intended for liberal arts colleges, how- 
ever, and complex universities attempting long-range fiscal projections 
have had to deviate from the Tickton format. Hungate and Meeth (1964), 
in a study of 25 liberal arts colleges, suggested procedures for examining 
and relating certain facts to cost so that sound qualitative decisions can be 
made. They noted that long-range planning is often based on inadequate 
knowledge of quality and of ways to improve it without increasing costs. 

Tuition increases are a major concern to private colleges; for most of 
them, long-range planning requires painful attention to this matter. Harris 
(1962) and Henderson (1964) are among the writers who have addressed 
themselves to various aspects of this problem. 


Development Programs 


Just as major private universities are today public in that they are 
very dependent upon government grants for support, the stronger state 
universities are today private in that they seek sizeable gifts and grants 
from private sources. There is some speculation but as yet no research on 
what effects, if any, success of such development programs might have 
on legislative attitudes toward appropriations. 

A study of 278 college capital campaigns, published by the Council for 
Financial Aid to Education (1964a), summarized data on goals and 
amounts raised by various types of institutions and by geographic areas. 
There was also an appraisal of short-of-goal campaigns. 

The Council for Financial Aid to Education and the American Alumni 
Council are the two major sponsors of surveys on voluntary support of 
colleges and universities. The data, however, are difficult to evaluate. One 
factor is the low rate of return. For example, the latest available survey 
report of the Council for Financial Aid to Education (1964b) showed 
that only 53 percent of the 1,948 institutions invited to take part had 
responded. Another factor is the variation in the number of institutions 
responding from one survey to the next and in the number responding 
to the surveys of the two councils. 

Venman (1962), in a study of gift annuity agreements for colleges, 
obtained information from 160 liberal arts colleges having such agreement 
programs. He concluded that colleges do not use such agreements to their 
best advantage and that it should not be difficult to establish a safe annuity 
program if a college understands all aspects of the program. 
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Year-Round Use of Physical Plant 


What had at first seemed like a simple proposal—to ardore Vi ie 
students in a given plant by adopting a year-round school calend ar— ba 
become a subject of much debate and study. No attempt will be ma 
here to unravel the tangled skeins, to separate theory from practice an 
hope from reality. T 

An inventory of data on year-round plans at 40 colleges and universi án 
was made by Tickton (1963). Stickler and Carothers (1963). distan 
the status, trends, and problems of the year-round calendar, giving specia 
attention to its financial implications. Webb (1963) discussed such systems 
with reference to Canadian colleges and universities. The problems and 


prospects of year-round education from kindergarten to college were 
examined by Schoenfeld and Schmitz (1964). 


Campus Planning 


The several meanings and many forms of campus planning were identi- 
fied and evaluated by Dober (1963) in the most comprehensive work to 
date on campus planning. He illuminated through historical examples the 
evolution of the campus as a design form, described the factors that make 
campus planning an important activity, commented on the functional and 
aesthetic aspects of the physical parts of the campus, and indicated in 
detail the procedures for preparing campus plans. 

Information on what is happening in four major types of campus 


buildings—classroom, laboratory, library, and dormitory—was brought 
together and interpreted in a report prepared by the Educational Facilities 
Laboratories (1964). Riker and Lopez (1961), in their study of college 
housing, presented a 


number of interesting tabulations on costs of operat- 
ing dormitories, 


1 Space requirement standards, and projections of needs. 
Zisman and Powell (1962) identified the problems involved in moving à 


college from an old campus to a new one, basing their observations on à 
case study of four colleges that undertook such a move, 


cussed by Weinstock (1961) ina rep 
of Technology. Lacy (1962) compil 


r and others (1961). 
arch Center examined horizontal and 
uildings (19613), parking problems for 
fixture requirements in instructional and 
research buildings for short-term 


vertical circulation in university þ 
universities (1961b), plumbing 
research buildings (1961c), and 
programs (1961d). 
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Concluding Observations 


The organization and administration of colleges and universities are 
virgin territories for research. They have been explored in the past by 
reputable scholars and practicing administrators, but within different 
contexts from those now available. It is in the context of current frames 
of reference—the theories and concepts developed by the behavioral 
Scientists and students of public and business administration—that the 
territories are uninvestigated. Exploration may indeed show that colleges 
and universities are largely or wholly different from other organizations. 

The major barrier to the new exploration is the sensitivity, frequently 
bordering on resistance, of colleges and universities to intensive scrutiny 
of organizational values, of administrative behavior, and of patterns of 
authority, influence, and communication. 

In the area of finance, colleges and universities are more willing now to 
make data available than they have been in recent decades. This will 
greatly aid research on the economics of higher education. 

Recent research on physical facilities has paved the way for compre- 
hensive and exciting forms of campus planning. 
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CHAPTER IX 


Government and Education 


C. RUSSELL DE BURLO, JR. 


Several great battles have ensued within committees, within committee 
hearing rooms, and on the floor of Congress over the question of federal 
aid to education, since the last issue of the Review (October 1960) that 
dealt with government, administration, coordination, and financing of 
higher education. Many questions have been raised about the relationship 
of the federal and state governments to higher education. Answers have 
been sought to such questions as the following: (a) What have been 
the effects of the massive appropriations of federal funds upon the insti- 
tutions of higher learning that have received them? (b) Has the heavy 
concentration of these funds in scientific research diverted attention and 
talent from teaching functions? (c) Have these colleges and universi- 
ties found that federal dollars are followed closely by federal control? (d) 
What has been the effect on the quality of higher education at the under- 
graduate and graduate levels? Research on these questions during the past 
five years has taken the form not of controlled experimentation but of 
personal points of view, comparative clinical analyses of specific institu- 
tions, surveys of quantitative data on existing legislation, analyses or 
descriptions of events or opinions, and the like. 

Most of the literature considered whether the federal government should 
provide general aid to all higher educational institutions. The answer 
to such a question, with its broad social implications, hinges largely on 
the consensus of leaders in the federal government’s legislative and execu- 
tive branches, in colleges and universities, in educational associations, 
and among the American people at large. Too often, however, such a 
consensus has not been reached because their points of view focused on 
different levels or areas of interest. 

Significant references found during the period reviewed were concerned 
mainly with these levels and areas: (a) identification of the responsibili- 
ties of federal, state, and local governments for higher education; (b) 
identification of goals for these complex entities; (c) the composition 
of the federal government interest and the diversity of higher education: 
(d) the legislative process; (e) a topical analysis of past and present 
governmental support to higher education; (f) recommendations or pre- 
dictions for the future relationship of institutions of higher learning to 
the federal government. 

The most useful bibliographies were compiled by the American Coun- 
cil on Education (1963a) and by Little (1963). The relative absence of 
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research studies in these references indicates the need for research, re- 
gardless of methodology, in the complex and enormous field of relation- 
ships between government and higher education. 

Additional Reference: American Council on Education (1964) . 


Federal, State, and Local Governmental 
Responsibility for Education 


As indicated in Munger and Fenno (1962), federal involvement at 
the elementary and secondary levels of education generates much opposi- 
tion within and without Congress. The excellent descriptive survey by 


Quattlebaum (1960) of federal educational activities indicates by the 
virtual absence of pr 


levels that indeed the 


plicit in congressional or 
“state relations, federal control of 


mmittee on Education and Labor, 
needs of colleges and universities 
» most of her lecture at the Harvard 


^ ics o the instituti ig educa- 
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rd Institutional management areas of possible state control, of the 
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orey (1961), a highly experienced public university administrator, 
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ually on a calendar year 
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Goals of the Federal Government and of 
Higher Education 


The literature pertained overwhelmi 
higher education. Knight (1960), as 


362 


ngly to the federal government and 
editor and contributor of two of the 


October 1965 GOVERNMENT AND EDUCATION 


five chapters, dealt in sweeping prose with the purposes of colleges and 
universities and the national goals of the federal government. He con- 
cluded that our institutions of higher learning are not only the wise 
servants of our society but its advisers as well and thus must be able to 
provide wise counsel. Knight did not come to conclusions but did suggest 
five planning areas for working national groups. He emphasized that there 
is a need for coordination at the federal level and within the diverse 
higher education sector of our society. 

Babbidge and Rosenzweig (1962) found the same critical need for a 
coordinated effort by federal agencies to meet the needs of higher educa- 
tion. They questioned the federal system as a whole and whether it is 
well designed to meet higher education's needs for facilities, more and 
better trained faculty, expanded graduate instruction, and student financial 
aid. Further, they maintained that the dispersed nature of federal pro- 
grams affecting higher education tended to fragment the institutions and 
their policy-making power. Babbidge and Rosenzweig supported this con- 
clusion with an analysis of institutional decision making under the spon- 
sored research grant and contract system, the matching-grant principle 
in health and graduate research facility programs, and the various gradu- 
ate fellowship programs. They perceptively concluded that these uncoordi- 
nated federal programs shift the decision making for the allocation 
of university resources from central administrative authority to the indi- 
vidual investigator and that federal standards were applied frequently to 
the institutions. Four researchers—de Burlo (1960), Knight (1960, 
Chapter 3), Orlans (1962), and the Carnegie Foundation for the Advance- 
ment of Teaching (1963)— supported the above conclusion with details 
of administrative problems but did not concede that federal standards 
should always be accepted by the institutions receiving sponsored research 
money. 

Because of the complexity of federal programs and the diversity of 
American higher education, none of the references stipulated specific 
goals for the government or for higher education. Representative Green 
(Reuther and Green, 1964) did state the general principle that higher 
education is a matter of public policy—but so did Jefferson in 1806. The 
Carnegie Foundation (1963) stated that many persons, in both education 
and government, held the opinion that the federal government's involve- 
ment with institutions of higher education must be concerned with whole 
institutions, not just specific parts thereof. Harrington (1963) took the 
same point of view in his trenchant opinions on the issues between higher 
education and the federal government. 

To paraphrase Wilson (1963), an array of problems still face higher 
education in its relationship with the federal government. Of critical im- 
portance is the need to clarify the goals of education and to determine 
the best ways of achieving them. 

Additional References: Hannah (1961); American Council on Educa- ' 
tion (1964). 
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Composition of Federal Interest and the 
Diversity of Higher Education 


The first major research during this period was the Quattlebaum (1960) 
survey, an exhaustive compendium of federal historical background, of 
federal agencies with programs in higher education, and of the classifica- 
tion of such programs. This survey, prepared for the House Committee on 
Education and Labor, proved conclusively the opinion proffered in the 
other references that the federal government is investing enormous sums 
of money in a vast number of ventures concerned with higher education. 
In fact, this survey showed the luxuriant growth of federal funds for 
research. It is hoped that the Legislative Reference Service of the Library 
of Congress will update the Quattlebaum survey. 

Little (1963) submitted a survey, based largely on 1959 data, of federal 
Programs in higher education to the Commissioner of Education (then 
Sterling M. McMurrin), pursuant to Title X, Section 1001 (d) of the 


National Defense Education Act of 1958. The Survey was in three parts: 


programs in col- 


nalysis of the two patterns of federal involve- 
tions of higher education by 
type and by source of control. aa SM 


; and Wilson, 1963) called f leader- 

Ship by some higher educational group postali 
oup t sitive 
Proposals for federa] involvement, SA aes. Mone és 
Further, these surveys, 


» Orlans (1962), and 
eaching (1963), made 
is not a simple, single 
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dealt with bills on higher education. Munger and Fenno (1962) and 
Babbidge and Rosenzweig (1962) examined the process from different 
points of view. Munger and Fenno, from the point of view of the political 
scientist, made a perceptive and thorough political action analysis with 
regard to higher education in the Congress, in its committees, and in the 
Executive Office. Babbidge and Rosenzweig’s brief comments emphasized 
the importance of the appropriations committees and process from a point 
of view, as they entitled it, of “The Big Picture: Congress, Committees, and 
Commissions.” 

The pleas of the several years prior to 1963 for a coherent set of 
proposals from higher education for federal support were met by the 
American Council on Education’s (1963b) statement of a “broad pro- 
gram of Federal action to help expand and improve American higher 
education.” In order of priority, objectives were in the following areas: 
increasing physical facilities for instruction, research, and student hous- 
ing; increasing faculty supply and quality of instruction and research; 
lowering financial barriers to higher education for qualified students; 
paying full costs of federally sponsored research; extending the urban 
renewal program; and six others. Amounts of money were specified in 
some cases, but there was little mention made of such considerations 
as organizational methods and eligibility criteria for meeting these 
objectives. 

One wishes that Munger and Fenno would update their fascinating 
analysis to provide, with the same objectivity, the reasons for the defeat, 
despite Senator Morse’s (1963) well balanced endorsement, of the Ken- 
nedy Higher Education Bill of 1962, or, to phrase it another way, the 
political accident rating. On the other hand, Babbidge and Rosenzweig’s 
forecast of federal support for academic facilities, the first priority of 
the American Council on Education, came to fruition with the passage 
and signing by President Johnson of the Federal Higher Education 
Academic Facilities Act of 1963. In early 1965, HR 3220, four titles plus 
a definition title, was with the Committee on Education and Labor. Title 
IV of this bill is for student assistance in the form of federal scholar- 
ships flowing through the states by formulas and a federal guaranty pro- 
gram on student loans. The Academic Facilities Act also channeled the 
funds through the states on a formula basis. 

Along with the congressional committees, Munger and Fenno (1962) 
underscored the importance of monitoring associations that represent 
interest groups within the diverse elements of higher education. As 
Quattlebaum’s (1960) survey made clear by sheer numbers and as Bab- 
bidge and Rosenzweig (1962) reiterated, probably no other segment of 
American society has so many organizations and is yet so unorganized 
as higher education. However, two interest groups have had a decided 
impact on federal legislation, particularly in the past two years: the 
American Council on Education and the National Education Association. 
Munger and Fenno (1962) emphasized the role of the latter organization 
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in the legislative process. Proponents of legislation need to maximize 
support by minimizing conflicts among supporters and to marshal their 
support at critical points in the legislative process. Munger and Fenno 
provided a delightful exposition of “strategies of substance” and “strate- 
gies of approach” in the fine art of practical politics involved in advanc- 
ing a bill. 

Additional References: McGrath (1962) : Price (1962). 


Topical Breakdown and Analysis of Past and Present 
Federal Government Support to Higher Education 


The references were decidedly weighted toward federally sponsored 
research programs in colleges and universities, and all stated at some 
point that federal interest in higher education has been categorical, 
dealing with specific problem areas and not with higher education as a 
whole through general aid in the national interest, 

Federal support of university research has been the most significant 
part of the government’s relationships with American higher education 
institutions; in fact, research support means just that. With few excep- 
i ulting from such support have 


been, in government eyes, by-products rather than primary objectives. 


The government agencies with the largest 
and science—have been largely confined b 
Quattlebaum (1960) , Knight (1960), de Burlo (1960), Orlans (1962), 
(deh, en hen. ey Foundation for the Advancement of Teaching 
os ce : er and Green (1964) all made this point, with quantita- 
ail given by Quattlebaum, Orlans, Little, and the Carnegie 
Foundation. i 
nds in enormous sums have been provided on a 
venue-producing facilities 


as well as for health research facilities, as discussed carefully by Babbidge 


and Rosenzweig (1962), many millions of dollars have gone also into 
scientific research in the past 25 years, particularly since World War II, 


in an ever-increasing spiral. However, research funds tend to be dis- 
tributed in large amounts to relatively few institutions. As detailed in the 
Carnegie (1963) study’s 26 clinical analyses of higher education institu- 


tions and as supported by both the Orlans (1962) study of 36 institutions 
and one of the papers of the American Council (1963a), fewer than 500 
of the 2,000 American colleges and universities received any federally 
sponsored research money in 1962, and 80 percent of all available funds 
went to only 50 institutions in that year. The total funds in 1962 came 
to $613 million. 

Another reference for the magnitude and impact of federal funds on 
research in universities was the excellent review and position paper by 
the National Academy of Sciences (1964). The Committee on Science and 
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Public Policy of the National Academy stated its seemingly unalterable 
position in favor of research project grants and contracts as the back- 
bone of federal support of basic science research in universities. It did 
not mention intra-institutional problems, as did de Burlo (1960), Orlans 
(1962), and the Carnegie Foundation (1963). While Babbidge and 
Rosenzweig (1962) were concerned about a dispersal of policy-making 
power within institutions having substantial sponsored-research programs 
(as were de Burlo and Orlans for the administrative problems created), 
the National Academy did not go into the subject, except to state (a) that 
the acceptance of a grant by the scientist commits him to a conscientious 
effort to achieve its stated purpose and (b) that he acquires no other 
rights to the granted or contracted funds. The Academy position paper 
also stated that the administrative officer of a university, upon accepting a 
grant or contract, causes the university to be legally responsible for the 
care and proper disbursement of the government funds. The paper said 
also that administrative officers should consider the relationship of re- 
search proposals to the development of the long-range interests of their 
universities. 

Somewhat counter to this view were Chapter 3 in Knight (1960) and 
the previously cited works of Babbidge and Rosenzweig, Orlans, de Burlo, 
and the Carnegie Foundation, all of whom questioned the administrative 
officer's freedom of action to refuse a grant or contract. 

Additional Reference: American Council on Education (1964). 


Effects of Federally Sponsored Research in 
Higher Education 


The intensive comparative clinical analyses by Orlans (1962) and the 
Carnegie Foundation (1963) are certainly landmark studies of the effects 
of massive federal expenditures for research in universities and colleges. 
Orlans concluded that the effects of federally sponsored research pro- 
grams have been profound and beneficial in the sciences, noticeable but 
n the social sciences, and negligible in the humanities. 
education had a deleterious effect 
ng and reduced personal contact 
d sophomores, especially at large 


more imbalanced i 
The emphasis on graduate research and 
on the status of undergraduate teachir 
between senior faculty and freshmen an 
universities. 

Orlans and the Carnegie Foundation agreed fully with the National 
Academy of Sciences (1964) that the project system of support is vital 
to the maintenance of high professional standards and the freedom of 
the individual investigator but that broader forms of support are desir- 
able to strengthen (a) other institutions that offer the doctorate and are 
not in the top 50 recipients of present support as well as (b) neglected 
scientific and educational areas. ` 

Additional Reference: American Council on Education (1964). 


367 


REVIEW oF EDUCATIONAL RESEARCH Volume XXXV, No. 4 


Future Relationship Between Institutions of Higher 
Education and the Federal Government 


Certain trends have been discerned by Babbidge and Rosenzweig 
(1962). These were that programs that were narrowly conceived will 
become more general, that programs will broaden their institutional 
coverage of graduate fellowships, that federal programs will broaden 
vertically by widening the focus from graduate schools to embrace under- 
graduate education as well, and that the social sciences will get increas- 
ing support. Present signs indicate that these trends will continue to 
develop. It will probably also come to pass as broader financial commit- 
ments are made, such as an extension of the National Science Foundation 
or the National Institutes of Health general research grants, that the 
federal government will come very close to general aid to higher educa- 
tion. In this event, the issue of qualitative judgments on colleges and 
universities by federal agencies, even with the assistance of academic 
panels, will undoubtedly create controversy. Research is badly needed 
for various types of institutional evaluation. 


The administrative staffs of both universities and federal agencies will 
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